Chapter 1 


Perturbation theory 


© B. Zwiebach 


It is often the case that the Hamiltonian of a system differs slightly from a Hamiltonian 
that is well studied and completely understood. This is a situation where perturbation the- 
ory can be useful. Perturbation theory allows us to make statements about the Hamiltonian 
of the system using what we know about the well studied Hamiltonian. 

The well studied Hamiltonian could be the that of the simple harmonic oscillator in one, 
two, or three dimensions. In a diatomic molecule, for example, the potential that controls 
the vibrations is not exactly quadratic; it has extra terms that make the vibrations slightly 
anharmonic. In that situation the extra terms in the potential represent perturbations of the 
Hamiltonian. The hydrogen atom Hamiltonian is also a well understood system. If we place 
the atom inside a weak external magnetic field or electric field, the situation is described by 
adding some small terms to the hydrogen Hamiltonian. Similarly, the interaction between 
the magnetic moments of the proton and the electron can be incorporated by modifying the 
original hydrogen atom Hamiltonian. The interaction between two neutral hydrogen atoms 
at a distance, leading to the van der Waals force can be studied in perturbation theory by 
thinking of the two atoms as electric dipoles. 


The Hamiltonian of interest is written as the understood, original Hamiltonian H, 
plus a perturbation 6H: 
HO + 6H. (1.0.1) 


Since H©) is Hermitian and the sum must be a Hermitian Hamiltonian, the perturbation 
operator 6H must also be Hermitian. It is convenient to introduce a unit-free constant 
A € [0,1] and to consider, instead, a \-dependent Hamiltonian H(A) that takes the form 


H(A) = HO +.6H. (1.0.2) 


When A = 1 we have the Hamiltonian of interest, but A allows us to consider a family of 
Hamiltonians that interpolate from H©), when » is equal to zero, to the Hamiltonian of 
interest for A equal to one. In many cases perturbations can be turned on and off; think, 
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for example, of an atom in an external magnetic field that can be varied continuously. In 
that case we can view \ as the parameter that allows us to turn on the perturbation by 
letting A ~ 0. The parameter is also useful in organizing the perturbation analysis, as we 
will see below. 

We spoke of a Hamiltonian that differs slightly from H). In order to use perturbation 
theory we need ASH to be a ‘small’ perturbation of the Hamiltonian H). We will have to 
deal with the meaning of small. At first sight we may imagine that small means that, viewed 
as matrices, the largest entries in AOH are smaller than the largest entries in H©). While 
this is necessary, more is needed, as we will see in our analysis. An additional advantage of 
using A is that by taking it to be sufficiently small we can surely make AdH small. 


We assume that the Hamiltonian H) is understood, namely, we know the eigenstates 
and eigenvalues of H(). We want to know the eigenstates and eigenvalues of H (A). One may 
be able to calculate those exactly, but this is seldom a realistic possibility. Diagonalizing 6H 
is seldom useful, since 6H and H do not generally commute and therefore 6H eigenstates are 
not eigenstates of H(A). In perturbation theory the key assumption is that the eigenvalues 
and eigenvectors of H(A) can be found as series expansions in powers of A. We hope, of 
course, that there are some values of for which the series converges, or at least gives useful 
information. 


E(x)\ pene 
a 
geese SS ts jo 


X 


Figure 1.1: The energy eigenvalues of H(A) as varies from zero to one. On the \ = 0 vertical 
axis the H) eigenstates are represented by heavy dots. By the time \ = 1 the dots have shifted. 


In Figure 1.1 we illustrate some of the phenomena that we may see in the spectrum 
of a system with Hamiltonian H(X). We show how the energies of the various states may 
change as the parameter \ is increased from zero. The two lowest energy eigenstates are 
non-degenerate and their energies can go up and down as X varies. Next up in energy we 
have two degenerate states of H) (the Hamiltonian as \ = 0). The perturbation splits the 
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two levels, and that happens generically. In the figure, the perturbation splits the levels to 
first order in A, as shown by the different slopes of the two curves that meet at A = 0. In 
other words, viewed as power series in » the energies of the two states have different linear 
terms in A. The last level shown corresponds to four degenerate states. The perturbation 
to first order in X splits the states into a group of three states and a fourth. To second order 
in A the three states split further. A single Hamiltonian can exhibit behavior like this, with 
many possible variations. 

To analyze the evolution of states and energies as functions of A we have two possible 
cases: (i) we are following a non-degenerate state or, (i7) we are following a collection of 
degenerate states. The challenges are quite different and therefore we must analyze them 
separately. Clearly both situations can occur for a single Hamiltonian, depending on the 
spectrum of H©). To follow a non-degenerate state we use non-degenerate perturbation 
theory. To follow a set of degenerate states we use degenerate perturbation theory. Since 
Hamiltonians H© generally have both non-degenerate and degenerate states we need to 
consider both types of perturbation theory. We begin with non-degenerate perturbation 
theory. 


1.1 Nondegenerate perturbation theory 


We begin by describing the original Hamiltonian H). We assume this Hamiltonian has a 
discrete spectrum with an orthonormal basis |kO)) of energy eigenstates, where k € Zisa 
label that ranges over a possibly infinite set of values: 


HO |KO) = BORO), (KOU) = Sy. (1.1.1) 


We will let & = 0 denote the ground state and we order the states so that the energies 
generally increase as the value of the label increases, so that 


0) 


BO) < BO-< BO) 2 BO Sous (1.1.2) 


The equal signs are needed because some states may be degenerate. 


In this section we focus on a non-degenerate state |n)) with fixed n. This means that 
|n()) is a single state that is separated by some finite energy from all the states with more 
energy and from all the states with less energy. In other words the following must be part 
of the sequence of inequalities in (1.1.2) 


i IY, POY iB) EN (1.1.3) 


If the chosen state is the ground state, we have instead EO < EO. 


As the perturbation is turned on by making X different from zero, the energy eigenstate 
In) of H© will evolve into some energy eigenstate |n), of H(A) with energy E,(A): 


H(A)|n), = En(A)|n)a, (1.1.4) 


4 CHAPTER 1. TIME INDEPENDENT PERTURBATION THEORY 


where 


In),-0 = |r), and E,(A=0) = BO. (1.1.5) 


As we said, the solution is assumed to take the form of a regular power series expansion 
in A. To make this clear consider a function f(A) such that its derivatives to all orders exist 
for A = 0. In that case we have a Taylor expansion 


fvY=>) ~F(0) NM = fFOO)+HOA+4E/ ON +4") AB +-- (1.1.6) 
n=0 ~ 


The expansion is a power series in A, with coefficients f(0), f’(0), etc, that are \ independent 
and reflect the value of the function and its derivatives at \ = 0. 
For our problem we note the values of |n), and E,(A) for A = 0 (1.1.5) and write: 


|n), = \n)) we An) at: 2 In2)) se r3)n@)) Hess 
(L17) 
E,(A) = B® +\EO +2E@ +3 28 +... 


n 


The superscripts on the states and energies denote the power of » that accompanies them 
in the above expressions. The above equations are a natural assumption; they state that 
the perturbed states and energies, being functions of A, admit a Taylor expansion around 
A = 0. Our aim is to calculate the states 


jn), [n), jn®),... (1.1.8) 
and the energies 
BO, SOs CB sae (1.1.9) 


Note that all these states and energies are, by definition, \ independent. Here |n)) is the 
leading correction to the state |r) as we turn on A, and EY) 
the energy as we turn on \. We will not impose the requirement that |r), is normalized. It 
suffices that |r), is normalizable, which it will be for sufficiently small perturbations. For 
\ = 1 we would find the solution for H(1) = H© + 6H in the form 


is the leading correction to 


|n) = |n)1 = |r) + |r) + [n) + |n™) +..., 
(1.1.10) 
Be) Ee) ee OO As Oe ON ae ON Sa 


Substituting the ansatz (1.1.7) into the Schrédinger equation (1.1.4) we will find the 
conditions for such solution to exist: 


(H© + 5H — E,(A))|n), = 0, (1.1.11) 
which more explicitly takes the form 


(Hz ~ EO) — (BO — 5H) — 2B — 386) —...— * BH 4. ; 
(1.1.12) 
(In) + An) + 2|Jn@) 4.3|n@) 4.0.4 Fin) +. .) a0 
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Multiplying out we get a series in A with coefficients A-independent vectors in the state 
space of the theory. If this is to vanish for all values of A those coefficients must be zero. 
Collecting the coefficients for each power of \ we 


EE) |) , 


n 


+ EQ) |n®) + BO nl), 


— 6H)|n@-Y) 4 BO) In) 400 + BO [n©), 


(1.1.13) 
Each equation is the condition that the coefficient multiplying the power of A indicated to 
the left vanishes. That power is reflected as the sum of superscripts on each term, counting 
OH as having superscript one. This gives a simple consistency check on our equations. These 
are equations for the kets |n@),|n()),... as well as the energy corrections BY) EQ) ths 
Note again that \ does not enter into the equations, and thus the kets and energy corrections 
are independent. 
The first equation, corresponding to °, is satisfied by construction. The second equa- 
tion, corresponding to \!, should allow us to solve for the first correction |n“)) to the state 


) 


and the first correction Eo to the energy. Once these are known, the equation corre- 


sponding to A? involves only the unknowns |n?)) and Ee), and should determine them. At 
each stage each equation has only two unknowns: a state correction |n(*)\ and an energy 


correction EM) ; 


A useful choice. We now claim that without loss of generality we can assume that all the 
state corrections |n*), with k > 1 contain no vector along |n). Explicitly: 


(1.1.14) 


To show this we explain how we can manipulate a solution that does not have this 
property into one that does. Suppose you have solution in which the state corrections |n*)) 
have components along |n()): 


In) = In)’ — apn), k>1, (1.1.15) 


with some constants a; and with |n))’ orthogonal to |n)). Then the solution for the full 
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corrected state is 


Pda = I) +9 (hy! — ann) 42% (hn) — cain) Fae 
= (1— ay — apd? —...)[n) + Am + 7[@y +... - 


Since this is an eigenstate of the Hamiltonian H(A), it will still be an eigenstate if we change 
its normalization by dividing it by any function of A. Dividing by the coefficient of |n()) 
we have the physically identical solution |n)\ given by 


1 
t= |,O) 4 yy Oa 21, QV 117 
Pr = In) + nN +n] (1.1.17) 
We can expand the denominator so that we get 
In) = (nO) + AjWMy! + A? (\n)! + ay|[n)’) +... (1.1.18) 


The explicit expressions do not matter, the key point, actually visible in (1.1.17), is that 
we have a physically identical solution of the same equation in which the state corrections 
are all orthogonal to |n()). This shows that we can impose the conditions (1.1.14) without 
loss of generality. 


Solving the equations. Let us finally begin solving equations (1.1.13). For this we note 
that the Schrodinger equation for the ket |n)) implies that for the bra we have 


(n|(HO — BO) =0. (1.1.19) 


This means that acting with (n| on the left-hand side of any of the equations in (1.1.13) 
will give zero. Consistency requires that acting with (n| on the right-hand side of any of 
the equations in (1.1.13) also give zero, and presumably some interesting information. For 
the A-equation this gives: 

0 = (n©\(EY — 6H)\n). (1.1.20) 


Since |n) is normalized and BY isa number, this means that 
EY = (n |6H|n©). (1.1.21) 


This is the most famous result in perturbation theory: the first correction to the energy of 
an energy eigenstate is simply the expectation value of the correction to the Hamiltonian 
in the uncorrected state. You need not know the correction to the state to determine the 
first correction to the energy! Note that the hermicity of 6H implies the required reality of 
the energy correction. 


We can actually find some interesting formulae (but not yet fully explicit!) for the higher 
energy corrections. For the \? equation, acting with (n| on the right-hand side gives 


0 = (nO |((E — 6H)|n) + B®) jn). (1.1.22) 
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Recalling our orthogonality assumption, we have (n©|n“™) = 0 and the term with ED 
drops out. We get 
E2) = (nO6H|n®) , (1.1.23) 


which states that the second correction to the energy is determined if we have the first 
correction |n“)) to the state. Note that this expression is not explicit enough to make it 
manifest that E®) is real. This and the earlier result for the first correction to the energy 
have a simple generalization. Acting with (n©)| on the last equation of (1.1.13) we get 


= (n |( (EY — §H)|In®-Y) 4 B® |In&—) 4... 4 EO in)) (1.1.24) 


Using the orthogonality of |n()) and all the state corrections, we have 
0 = —(n|6H)|\nk-!) + EB), (1.1.25) 


and therefore we have 
E®) = (n©| 6H |n&-), (1.1.26) 


n 


At any stage of the recursive solution, the energy at a fixed order is known if the state 
correction is known to previous order. So it is time to calculate the corrections to the 
states! 


Let us solve for the first correction |n“)) to the state. This state must be some particular 
superposition of the original energy eigenstates |KO), For this we look at the equation 


(H — BO) nM) = (BY — 5H)\n). (1.1.27) 


This is a vector equation: the left-hand side vector set equal to the right-hand side vector. 
As in any vector equation, we can check it using a basis set of vectors. Forming the inner 
product of each and every basis vector with both the left-hand side and the right-hand side, 
we must get equal numbers. We already acted on the above equation with (n| to figure 


out EY. The remaining information in this equation can be obtained by acting with all 
the states (k| with k 4 n: 


(kK |(H© — EO) (2M) = (kO|( EO — 6H)\n). (1.1.28) 


On the left-hand side we can let H©) act on the bra. On the right-hand side we note that 
with k £n the term with EY vanishes 


(EO — EO) (k(n) = —(k|sH|n©). (1.1.29) 
To simplify notation we define the matrix elements of 6H in the original basis 


SHimn = (m|6H|n), (1.1.30) 
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Note that the Hermiticity of 6H implies that 
OMA SO ed (1.1.31) 

With this notation, equation (1.1.29) gives 

OAK 


(KO [n) = - ee, 
BO = EO) 


kén. (1.1.32) 
Since we now know the overlap of |n“)) with all basis states, this means that the state has 
been determined. Indeed we can use the completeness of the basis to write 


Jn) = SO} (KD) = SJR) Of, (1.1.33) 
k kAn 

since the term with & = n does not contribute because of the orthogonality assumption. 

Using the overlaps (1.1.32) we now get 


(0) \5 Hn 


(1.1.34) 


This shows that the first correction |n()) can have components along all basis states, except 
jn). The component along a state |k)) vanishes if the perturbation dH does not couple 
In) to |k), namely, if Hn vanishes. Note that the assumption of non-degeneracy is 
needed here. We are summing over all states |k©)) that are not |n) and if any such state 
has the same H) energy as |) the energy denominator will vanish causing trouble! 


Now that we have the first order correction to the states we can compute the second 
order correction to the energy. Using (1.1.23) we have 


(0) (0) 
Be) = Opry = 2S ie (1.1.35) 
EO — pO 
kAn k n 
In the last numerator we have (n |5H|k©) = 6Hnx = (6Hgn)* and therefore 
OH Kn|? 
EB) =-Y 0 Hen| (1.1.36) 
n 0 0 
ign By — Bn 


This is the second-order energy correction. This explicit formula makes the reality of EY) 
manifest. 


In summary, going back to (1.1.7), we have that the states and energies for H(A) = 
H©) + \6H are, to this order, 


(1.1.37) 


PO) SENT A 


n 
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Remarks: 


1. The first order corrected energy of the (non-degenerate) ground state overstates the 
true exact ground state energy. To see this consider the first order corrected ground 
state energy Eo + LEO, Writing this in terms of expectation values, with \9()) 
denoting the unperturbed ground state, we have 


EO 4B = (0H) + (azo) 
= (0)(HO + r6H)\0) (1.1.38) 
(0 |H(A)|0). 


By the variational principle, the expectation value of the Hamiltonian on an arbitrary 
(normalized) state is larger than the ground state energy F(A), therefore 


BO 4 BO = (0 /H(A)J0) > Ep(r), (1.1.39) 


which is what we wanted to prove. Given this overestimate at first order, the second 
order correction to the ground state energy is always negative. Indeed, 


2 Pel 
Ss Bu BO — , (1.1.40) 
k#0 


and each term is negative because the unperturbed excited state energies Be (k # 0) 

exceed the unperturbed ground state energy Ee ) 

2. The second order correction to the energy of the |) eigenstate exhibits level repul- 
sion: the levels with k > n push the state down and the levels with k < n push the 
state up. Indeed, 


eos atta ao = -¥D a tees — (1.1.41) 


0 
a k>n Et 


The first term gives the negative contribution from the higher energy states and the 
second term gives the contribution from the lower energy states (see Figure 1.2). 


The systematics of solving the equations is now apparent. For each equation we take 
inner products with all states in the state space. That gives the full content of the equation. 
We first take the inner product with (n©|, as this makes the left-hand side equal to zero 
and is thus simpler. Then we take the inner product with (k| with all k 4 n and that 
gives the remainder of the information that is contained in the equation. 


Exercise 1. Calculate |n(?)) and B®), 
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Figure 1.2: The second order corrections to the energy of the state |n()) receives negative contri- 
butions from the higher energy states and positive contributions from the lower energy states. We 
have, effectively, a repulsion preventing the state |n()) from approaching the neighboring states. 


Exercise 2. The state |n), is not normalized. Use (1.1.37) to calculate to order A? the 
quantity Z,,(A) defined by 


1 

——- = ; 1.1.42 

Z,(0) a(n|n) ( ) 
What is the probability that the state |n), will be observed to be along its unperturbed 
version |n(°))? 


1.1.1 Validity of the perturbation expansion 


We now return to a question we did not address: What do we mean when we say that 
AOH is small? We have said that 46H must be small compared to the original Hamiltonian 
H©), but it is not clear what this means, as both expressions are operators. For some 
insight into this matter consider an example where H) is a two-by-two diagonal matrix 
with non-degenerate eigenvalues 


(0) 
H® = (" | ; (1.1.43) 
me 


The perturbation, called AV, only has off-diagonal elements so that 


BO yy 


1.1.44 
Ave Eo ( ) 


H() = H+4AV = ( 
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In this simple example there is no need to use perturbation theory since the eigenvalues, 
FE, and E_, can be calculated exactly as functions of 


Ex(d) = (BE) + #9?) + 


(1.1.45) 


} [2 


Figure 1.3: The Taylor expansion of the function f(z) = V1-+ z? about z = 0 has a radius of 
convergence equal to one. 


root in powers of . To perform this expansion we need the result 


2 4 6 5 


fizz VIF 2 =14 5-454 8+ OC"). (1.1.46) 


The function f(z) exhibits branch cuts at z = +7 (see Figure 1.3), thus the expansion 
of f(z) around z = 0 has radius of convergence equal to one: the series converges for |z| < 1 
and diverges for |z| > 1. Table 1 shows f(z) evaluated for z = 0.9, 1.2, and 1.5. The various 
approximations to the full series are shown. 

For our expansion of (1.1.45), convergence for |z| < 1 implies convergence when 


A\|V 
aA Ay ae AV] < 2 BO — 2), (1.1.47) 
1) (0) (0) a 2 

3|Ey — Ey" 
For |AV| > 1) 0) — EO) the perturbation series does not converge. We learn that for 
convergence the perturbation must be small compared with energy differences in H©. It 
is not sufficient that the magnitude of the matrix elements of 40H be small compared to 
those in H), energy differences matter. This result leads us to expect complications when 
energy differences go to zero and H) has degeneracies. 
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z 0.9 19 1.5 
f(z) 1.34536 1.56205 1.80278 
fs(z) 1.33939 1.47946 1.20297 

(z) 1.33939 1.67280 4.82288 
foo(z) 1.34490 1.36568  -18.4895 
(z) 1.34545 2.23047 641.772 


Table 1.1: f(z) = V1+ 22 =o, az! and f(z) = Diy cz’. 


1.1.2 Example: Anharmonic oscillator 


Consider the simple harmonic oscillator 


aD 
Pp 1 22 
H® = 5 4 line? . 1.1.48 
2m 72 ( ) 
We want to explore the effect of a perturbation proportional to ~ #+. This has the effect 
of changing the original quadratic potential for a more complicated potential that includes 
a quartic term. To do analysis in a clear way, we must consider units. Using the constants 
h,m,w of the harmonic oscillator a length scale d can be uniquely build: 
h 
CS (1.1.49) 
mu 
The unit-free coordinate </d then has a simple expression in terms of creation and annihi- 
lation operators: 


hi 
(@t+al)=d@+@) 


fies = a aly: 1.1.50 
P45 (a + a) (1.1.50) 


It follows that an @* perturbation with units of energy takes the form 


a4 m2? on 


rs thw(ataly*, (1.1.51) 


Using the unit-free parameter A, the perturbed Hamiltonian will therefore be 


We will identify the states |k©) of H© with the number eigenstates |k), k = 0,1,..., of 
the harmonic oscillator. Recall that 


BO = hw(k+4), |k) = |0). (1.1.53) 
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The Hamiltonian H(A) defines an anharmonic oscillator. In a classical anharmonic oscil- 
lator the frequency of oscillation depends on the amplitude of oscillation. In the quantum 
harmonic oscillator all levels are equally spaced. The frequencies associated with transi- 
tions between various levels are therefore integer multiples (7.e. harmonics) of the basic 
frequency associated to a transition between the first excited state and the ground state. 
In the quantum anharmonic oscillator the spacing between the energy levels is not uniform. 


First the simplest question: What is the first-order correction EY to the energy of the 


ground state? For this, following (1.1.21) we simply calculate the expectation value of the 
perturbation on the ground state: 


Bh) = (04 hw(a+aty'}oy = brw(O\(@+at)'|o)= hw, (1.1.54) 


where we used 
(0|(a + a')*|0) =3, (1.1.55) 


as you should verify. It follows that the corrected energy is 
Fo() = BO +E +002) = Siw + ABhw + O(A?) = hw (1+ 24+ O(02)) (1.1.56) 


We note that the energy of the ground state increases with \ > 0. This is reasonable as the 
quartic term in the modified potential squeezes the ground state. How about second order 
correction to the ground state energy? For this we use (1.1.36) taking n = 0: 


(2) |5 Axo|? 
kA0 EK = Ey 


The sum is over all & > 1 such that 6H;z9 is non-vanishing. Here 
6H = thw (k|(a+ at)*|0) . (1.1.58) 


We consider (@ + @')4|0) which corresponds, up to constants to acting on the ground state 
wavefunction yo with 2*. This should give an even wavefunction. So (@ + 4')4|0) must 
be a superposition of |0),|2), and |4). We cannot get states with higher number because 
there are at most four creation operators acting on the vacuum. A short calculation (do 
it!) confirms that 


(a+ a')4|0) = 3]0) + 6V2|2) + V4!|4). (1.1.59) 
This immediately gives 
6Hoo = hw, 6H = 382 tw, 6H = YB hw, (1.1.60) 


the first of which we had already determined and is not needed for the second order com- 
putation. Back in (1.1.57) we have 
BO = —|6Hao|? — |OHaol? _ (Fw)? 


nen) Aw Phu 
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Therefor the corrected ground state energy to quadratic order is 


Eo) 4 Eo) +H?) = le (i 8K 2EY’Y (1.1.62) 


The computation can be carried to higher order, as first done by Bender and Wu (Phys. 
Rev.184 (1969)1231). They find that! 
Eo(A) = hw (1+ 3A — 2h? + 388 8 — 30885 yA 4 OESTOT YP — Sots ord + O(A')) 
(1.1.63) 
As it turns out the coefficients keep growing and the series does not converge for any nonzero 
A; the radius of convergence is actually zero! This does not mean the series is not useful. 
It is an asymptotic expansion. This means that for a given small value of A the magnitude 
of successive terms generally decrease until, at some point, they start growing again. A 
good approximation to the desired answer is obtained by including only the part of the sum 
where the terms are decreasing. 


Exercise 3. Calculate the first order correction BY to the energy for the state |n) of 


number n. Exhibit the anharmonicity of the oscillator by using this result to find, to first 
order in A, the energy separation AF,(A) = En(A) — En—1(A) between levels. 


Let us now find the first order correction to the ground-state wavefunction. Using 
(1.1.34) with n = 0 we have 


bd Ho 
k40 EX _ Es 
We then find 
OH: OH. 
jo) = — 2) - orld) = —§v 212) — ievalld) 


Aho (1.1.65) 


= —3@lallo) -daatata'|o). 


This means that to first order the ground state of the perturbed oscillator is 


0) — A (3 atatloy +4 atatatat|o)) + 0(A2). (1.1.66) 


1.2 Degenerate perturbation theory 


If the spectrum of H) has degenerate states, as shown in Figure 1.1, tracking the evolution 
of those states as A becomes nonzero presents new challenges. We first show that naive 
extrapolation of our results for a non-degenerate state do not work. We will also be able to 
appreciate the basic difficulty. 


‘Bender and Wu’s results, in eqns. (2.12) of their paper must all be multiplied by a factor of 2, as they 
take Ao = 1/2. 
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1.2.1 Degenerate toy model 


Consider an example with two-by-two matrices. The unperturbed matrix H©) will be set 


equal to the identity matrix: 
HO = £ ; (1.2.1) 


We have a degeneracy here as the two eigenvalues are identical (and equal to one). The 
perturbation matrix 6H is chosen to be off diagonal: 


= € q (1.2.2) 


H(A) = HO + 6H = € i) (1.2.3) 


We then have 


x 


Using labels n = 1,2 the unperturbed eigenstates can be taken to be 


fi) == @ t. 2S @ , EO = BO <1, (1.2.4) 


with the corresponding eigenvalues indicated as well. To first order in A, the eigenvalues 
predicted from non-degenerate perturbation theory (1.1.37) are En(A) = BO + bina. 


This gives 
BOY. S206 = 10S 1? 


: (1.2.5) 
E\(\) = BO +)6H» = 14+-0=1? 


The eigenvalues are unperturbed to first order in since the matrix 6H is off-diagonal. 
These answers, however, are wrong. We can compute the exact eigenvalues of H(A) and 
they are 1+. There is also a problem with the state corrections. Equation (1.1.37) states 


that oat 
In), = Be oes tl o_o (0), (1.2.6) 


but this time, with EO = Eo the denominator is zero, and the 6H matrix element is also 
zero, giving us an ambiguous result. 
So what can we do? A direct calculation shows that 


1 
H(A) has eigenvectors —= () with eigenvalue =1+ 4, (237) 
v2\1 
and : ( ; ) with eigenvalue = 1— (1.2.8) 
het g : 2: 


You may think the eigenvectors jump from those of H() indicated in (1.2.4) to those of 
H(X) as soon as X becomes nonzero. Such discontinuity is totally against the spirit of 


16 CHAPTER 1. TIME INDEPENDENT PERTURBATION THEORY 


perturbation theory. Happily, this is not really true. The eigenvectors of H©) are in fact 
ambiguous, precisely due to the degeneracy. The eigenvectors of H©) are actually the span 
of the two vectors listed in (1.2.4). The perturbation selected a particular combination 
of these eigenvectors. This particular combination is the one that we should use even for 
» = 0. The lesson is that to get states that vary continuously as A is turned on we must 
choose the basis in the degenerate subspace of H) carefully. We will call that carefully 
selected basis the “good” basis. 


1.2.2 Systematic Analysis 
Again, we are looking at the perturbed Hamiltonian 
H(\) = H +H, (1.2.9) 


where H\) has known eigenvectors and eigenvalues. We will focus this time on a degenerate 
subspace of eigenvectors of dimension N > 1, that is, a space with N linearly independent 
eigenstates of the same energy. In the basis of eigenstates, H) is a diagonal matrix that 
contains a string of N > 1 identical entries: 


H® =diag{ BO, BO ,..., BO ,..., BO, ...}. (1.2.10) 


In the degenerate subspace we choose a collection of N orthonormal eigenstates 
[rn 1) ,|n 52), 22, nO: NY). (1.2.11) 


Accordingly, we have 


(2 - |r 1) = Ones (i212) 
HOMO: ky = EO\n©:k). (1.2.13) 


This set of vectors span a degenerate subspace of dimension N that we will call Vy 
Vn = span{|n©; Rik = ya Vb (1.2.14) 
The total state space of the theory, denoted by H is written as a direct sum: 
H=VnevV, (1.2.15) 


where V is spanned by those eigenstates of H that are not in Vy. We denote by |p) 
with p € Za basis for V. That basis may include both degenerate and non degenerate 
states. Together with the states in Vy we have an orthonormal basis for the whole state 
space: 

(p |g) = Spa, (p)|n©:k) = 0. (1.2.16) 


1.2. DEGENERATE PERTURBATION THEORY 17 


Our notation distinguishes the states in Vy from those in V because the former have two 
labels and the latter have only one. 


We now consider the evolution of the degenerate eigenstates as we turn on the pertur- 
bation. Again we assume that the states vary continuously in \ and thus write: 


[ns k) + [ns k), = [ns k) + dln; b) + 7[n; k) + OC), 


(1.2.17) 
EO + Eyg(d) = EO +ER2 + EO) +003). 
These equations hold for k = 1,...,N. Note that for each value of k the energy corrections 


might be different and that’s why the energy corrections carry the label k. Our goal is to 
find the state corrections |n(): k) and the energy corrections B® H for p > 1 and for each k. 


As before we demand that |n);k) for p > 1 has no component along |n; k), ice. 
(n: kin: k) =0 for po 1 (1.2.18) 


Note, however, that |n®);k) may have components along |n©; ¢) with € 4 k. So |n©: k) 
may and in fact will have a component in Vy. 
The perturbed eigenstates must satisfy 


F(A) |n;k)y, = Enp(A)|nzk)y, (1.2.19) 


and substituting the perturbative expansions above we obtain equations completely analo- 
gous to the ones in Eq.(1.1.13) 


Mx: (H° — EB) |n©;k) = 0, (1.2.20) 
Ab: (H°— B®) (nk) = (EO) — 6H)|n©; by, (1.291) 
2 (H°— BO) |n®sk) = (BE) — 6) In; k) + BP? nO; k), (1.2.22) 


In the following we will discuss a solution to first order for the case in which the de- 
generacy in Vy is completely broken to first order in perturbation theory; that is, the first 
order corrections to the energies split the N states completely. Our solution will proceed 
in three steps: 


1. Hit the O(A) equation with (n); é| to learn that 6H must be diagonal in the chosen 
basis for Vy and to determine the first-order energy shifts. 


2. Use the O(\) equation to calculate the components of |n“);k) in V. 


3. Hit the O(A2) equation with (n; ¢| to determine the second order energy correction 
jes and the component of |n“;k) in Vy. 
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Step 1. Recalling that (n; ¢\(H© — B®) = 0, as we hit the O(\) equation with (n; ¢| 
the left-hand side vanishes and we find 


(n ; e\(E} — 6H)\n© ;k) =0. (1.2.23) 
Since the basis states in Vy are orthonormal, this implies that 
(n 0) 6H |n:k) = ae O¢,k « (1.2.24) 


This equation holds for all k,@=1,...,N. Remarkably, this equation is telling us that the 
basis |n;k) must be chosen to make the matrix 6H diagonal in the subspace Vy! This 
is required in order to get the perturbation theory going. Setting @ equal to k we read the 
values of the first order energy shifts 


ELL = (n; k6H|In;k) = SH aknks (1.2.25) 


where the last equality is a definition. The energies to first order are then 
Enk(d) = EQ) + 6H nknk- (1.2.26) 


A few remarks: 


1. The above result for the first order energy shifts is true always, even if the degeneracy 
is not lifted. The degeneracy is lifted when 


BY x EY), whenever k # £, (2.27) 


n, 
for all values of k,@ = 1,...,N. This assumption will be used in the later steps. If 
the degeneracy is lifted, the basis states \n. k) that make 6H diagonal in Vy are 
called “good states” or a “good basis”. This means that they are the basis states in 
Vy that get deformed continuously as 4 becomes non-zero. If the degeneracy is not 
lifted to first order the determination of the good basis has to be attempted to second 
order. 


2. The perturbation 6H is diagonalized in the subspace Vy. The perturbation 6H is 
not diagonal on the whole space H, only within the block representing Vy is 6H a 
diagonal matrix. Alternatively we can see this via the action of 6H on the basis states. 
Introducing a resolution of the identity, we have 


6H [nr 2) = Son 54) (n©; ql6H[n©; 2) + So |p©) © [6H|n©; &) 
q Pp 


= Boe gin sa) + > [p) (pO [SH no (1.2.28) 
q Pp 
= E%}in; &) + 7p) PO |sH|n; 2). 
Pp 


This shows that the states |n @) are almost 6H eigenstates with eigenvalues equal 
to the first order energy corrections. The failure is an extra state along V. 
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3. We can sometimes assess without computation that a certain basis in Vy makes 6H 
diagonal. Here is arule: the matrix 6H is diagonal for a choice of basis in Vn if there 
is a Hermitian operator K that commutes with 6H for which the chosen basis vectors 
are K eigenstates with different eigenvalues. This is quickly established. Consider 
two different basis states in Vy: |n; p) and|n©:; q), with p 4 q. Assume these have 
K eigenvalues , and A,, respectively. Since [5H, K] = 0: 


0 = (n©; pl[6H, K]|n©;q) = Ag—Ap(n©; p| 6H |n© ; g). (1.2.29) 


Since the eigenvalues A, and Ag are presumed to be different, the non-diagonal matrix 
elements of 6H vanish. 


Step 2. The O(A) equation cannot determine the component of |n“;k) along Vy. As 
we will see later, such piece is required by consistency and gets determined from the O()?) 
equation. We now determine the piece of jn“; k) along V. For this we hit the O(X) equation 
with (p | to find 


|p) L Vy 
(|(H — HO )In;4) = (|( BEE — 6H)In;k) (1.2.30) 
and we get 
(ED) — ED) (pn; k) = —bHpnk (1.2.31) 


where we introduced the matrix element 6H, 4% = (p|dH |n;k). Our equation above 
means that the piece of |n“;k) in V is now determined: 


6H. nk (0) 1) 
nO: ky = —S*> Bh 1) 4 nO ky, (1.2.32) 
yy fo a Eo Vn 


where we included explicitly the still undetermined piece of \n@). k) along Vy. 


Step 3. We now hit we hit the O(A?) equation with (n); é|. The left-hand side vanishes 
and using the above expression for |n“);k) we find 

= _ (2 - 0B — (0)) _OFfp nk 

Oi (n CE 6H) ye lps) EO a EO 

p sd i (1.2.33) 


+ (nN, — SE) sk) HERO. 


In the first term on the right-hand side, the part proportional to EY) vanishes by orthonor- 


mality. On the second line, the term including 6H can be simplified because 6H is diagonal 
within Vy. Recalling (1.2.28) we have 


’ 


(nO; 5H = EC) (n; 6) +S \(n; 5H p) (p. (1.2.34) 
Pp 
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The piece in V vector drops out for our case of interest: 


(nO ; 5H |[n™; k) 


= EX} (nen; r)| : (1.2.35) 


Vn N 


Back into equation (1.2.33) we now get 
pa ey ede eae 
Sa at (z EO - Et)) (nen, k)| + Be) bp =0. (1.2.36) 
ED = Ey , Vn ‘ 
Setting € = k we can determine the second correction to the energies: 
(2) |S Hp nkl? 
| hg eee ee eee Zl y (1.2.37) 
ik oy BO 7 EO) 
For k 4 £ we get 


0A ne.p0Hp.nk @ _ pQ) 4,0). an). = 
s- 0 ao 4 (eo) 2 Eo)) (n:¢n;R)] = 0. (1.2.38) 


Had we not included the piece of In. k) along the degenerate subspace we would have 

had an inconsistency, since there is no reason why the first term on the left-hand side must 

be zero. Now the ee equation just fixes the components of \n(); k) in the degenerate 
(1), 

subspace as long as EC ee Ene 


1 OA ne.n6 Hy nk 
(n©: en: k)| = ao 3 OO , Ke. (1.2.39) 
ha = =a eae 
We thus have 
1 Ane p Ay nk 
Inky] = —S~ jn; 2) —_—_ as a ei ALL (1.2.40) 
a dX EO) a ao) Ss EO — EO 


It may seem that this extra piece, found by using the O(A?) equation, is higher order than 
it should in the perturbation: its numerator contains two powers of 6H. But this expression 
also has a curious energy denominator, EO) - EY) , in which each term has a power of 6H. 
All in all, the correction to the state is properly first order in 6H. 


Summarizing our result we have 


Degenerate perturbation theory with degeneracies lifted at O(,): 


oH i ae 6A, ne 
p P lt+k e ee ne Pp 


OAnkp OH 
En,k(A) = Eo) + A6Ank nk _ ost an OO): “a —, OA nk nk - 
Pp DATs en 


(1.2.41) 
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1.2.3. Degeneracy lifted at second order 


We now investigate the case when the degeneracy is completely unbroken to first order. 
The situation and the setup is similar to the one we just considered: we have a degenerate 
subspace Vy of dimension N and the rest of the space is called V. This time, however, we 
will assume that the degeneracy of H) is not broken to first order in the perturbation 6H. 
Concretely, this means that on the Vy basis |n);k) with k = 1,...,.N, we have 

(n; | 6H |r r= (1) Ook « (1.2.42) 
The first order energy correction is the same, and equal to EY for all basis states in Vy. 
You should compare with (1.2.24), where the energy had an extra subscript to distinguish 
its various possible values. 

Because the degeneracy is not broken to first order we do not know at this point what 
is the good basis in Vy. We will consider here the case when the degeneracy is completely 
lifted to second order. We express our ignorance about good basis vectors by stating that 
we are searching for the right linear combinations: 


N 
[p) = > (nk) a. (1.2.43) 
k=1 


For some values of the constants al”) with k = 1,...,N the state |) will be good. We 
can think of a) as the column vector representation of |¢)©)) in Vy. We have written 
just one state, |p), even though we are expecting to find N good states to span the 
degenerate subspace. We therefore adjust the notation to reflect this. We introduce a new 
index J=1,...,N and write 


N 
1p) = S> |n© ;k) a, T=1,...,N. (1.2.44) 
k=1 


The index J now labels the different good states and their different vector representations 


0) 


a\ . Our most immediate goal is to find those vectors al) and thus the good basis. To 


do so we will have to consider second order energy corrections. The states (yw) form an 


orthonormal basis in Vy if the coefficients a) satisfy 


WOW?) =6rr > aR ae = br. (1.2.45) 
k 


We set up the perturbation theory as usual 


eryx = |60) + rp) + r71WP) 4, 


BX) = BO) 48D 2 BO 4 BBY toce. 


n nl 


(1.2.46) 
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Note that in the energy expansion we have accounted for the degeneracy to zeroth and first 
order: the index J first appears in the second-order corrections to the energy. Using the 
Schrodinger equation 


A(A)lbr)a = Enr(A)ler)a, (1.2.47) 
gives the by now familiar equations, of which we list the first four: 
XY: (HO — BO) WP) = 9, 
Mi (HO — BO) WP?) = (BD — 6H)boy”), hate 
1.2.48 
VY: (HO — BO) WP) = (BD —sH)WY?) + Bee bP), 
2 (HO— BO) yP) = (BD — SEP) + BO WP) + BD Py. 


The zero-th order equation is trivially satisfied. For the order A equation the overlap with 
(n; | works out automatically, without giving any new information. Indeed, the left-hand 
side vanishes and we thus get 


0 = (nO; 9(BO — sH)[p). (1.2.49) 


Since (n: ¢| is a OH eigenstate with eigenvalue EY) up to a vector in V (see (1.2.34)), 
the above right-hand side vanishes. Acting on the order A equation with (p| gives useful 
information: 


(BP — EO) WP) = POEL -— 6H) = -@6HW), (1.2.50) 


using the orthogonality of V and Vy. Letting 
bHpr = (p [SHI , (1.2.51) 

we then have 

JH pr 


POW) =F 
I a) = E©) 


(1.2.52) 


Since the ket yw) is still undetermined, it makes sense to write this information about 


(0) 


(yw) in terms of the unknown a, ’ coefficients. We have 


N N 
6Hpr = S-(p 6H |n; kal? = S_ 6H, an al), (1.2.53) 
k=1 k=1 
Back into (1.2.52) we get 
(1) l ~ (0) 
(PO |p) = “Fo pw SOA pnk Gay (1.2.54) 
p “nN k=1 
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This gives the piece of i?) in V in terms of the unknown zeroth order eigenstates. 

We have now extracted all the information from the order \ equation. We look now 
at the order \? equation, which contains the second order corrections to the energy and 
therefore should help us determine the zeroth order good states. We hit that equation with 
(n; €| and we get 


0 = (nO: ¢(BM — 5H") otin®; eB — 6H)|p\) ’ + Bq. (1.2.55) 
N 


Happily, the second term, involving the components of we) along Vy, vanishes because 


(1) 


of the by now familiar property (1.2.28) adapted to this case. The piece with E;,’ on the 
first term also vanishes. We are thus left with 
0 = —(n;e\6H\yy)| + Epr aie - (1.2.56) 


Introducing a resolution of the identity to the immediate right of 6H, only the basis states 
in V contribute and we get 


0 = —So(n gdp (py?) + BD al? , (1.2.57) 
Pp 


where there is no need to copy the |», anymore. Using the result in (1.2.54) we now get 


= 2 FH ne aya oH ree 57 £5, a) (1.2.58) 
Be k=1 


Reordering sums and multiplying by minus one we get 


N 
Ant,p0Hp,n 
5 (-E Saetest ~ su )al? = 0 1.280 
k=1 Pp Ep — En 


To understand better this equation we define the N x N Hermitian matrix M() 


Hnep 6H pm 
My = ->- SHnep OH pink (1.2.60) 


, 0 0 
— A — A 


The equation then becomes 
N 
(Me — EA? ex) ayy = 0. (1.2.61) 
k=1 


Recalling that the Kronecker delta is the matrix representation of the identity, we have 


(me) — E71) a = 0. (1.2.62) 
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(2) 


This is an eigenvalue equation that tells us that the energy corrections E)7 are the eigen- 
values of M®) and the vectors a) are the associated normalized eigenvectors. These 
determine, via (1.2.44), our orthonormal basis of good zeroth order states. If 6H is known, 
the matrix M©) is computable and Hermitian and can therefore be diagonalized. 

We will leave the computation of the component of yw) on the degenerate subspace. 
That can be done if the degeneracy is completely broken to quadratic order (the eigenvalues 
of M() are all different). Still, it takes some effort and one must use the order \? equation. 
Our results so far are 


wr) ODL a A ao te) af),) 400%), 


Ema) = EO +5) me aoe 


|r) a 
(1.2.63) 


Here the all) are still unknown coefficients that determine the component of the first cor- 
rection to the states along the degenerate subspace. If you followed the discussion, all other 
symbols in the above equations have been defined and are computable given 6H. 


(1) 


The answer for the coefficients a LJ turns out to be 


al) _ 1 S- 6H Jp b Hyg dH gr 7 BO) sS- OF Jp bHpr (1 2 64) 
? 2 2 0 0 0 0 0 0)\2 = 
EO — BO) |e (BP) — ED)? — ER) ) _ pO) 
The third order corrections to the energy are 
2 


Chapter 2 


Fine Structure 


© B. Zwiebach 


2.1 Review of hydrogen atom 


The hydrogen atom Hamiltonian is by now familiar to you. You have found the bound state 
spectrum in more than one way and learned about the large degeneracy that exists for all 
states except the ground state. We will call the hydrogen atom Hamiltonian H©) and it is 
given by 


p e 
HO =P =. (2.1.1) 


We take this to be the “known” Hamiltonian, meaning that we know its spectrum. This 
Hamiltonian is supplemented with corrections that can be studied in perturbation theory. 
That study is the subject of this chapter. We begin, however, with some review and 
comments. 

The mass m in H©) is the reduced mass of the electron and proton, which we can 
accurately set equal to the mass of the electron. If one wishes to consider the case of an an 
electron bound to a nucleus with Z protons one lets e? + Ze? in H). The Bohr radius is 
the length scale build from i,m, and e? 

h2 
ag =—s ~ 53pm. 2:12 
0=— 2 P (2.1.2) 
The energy levels are enumerated using a principal quantum number n, an integer that 
must be greater or equal to one: 
e? 1 


E, =-— a Se ts 2.1.3 
y 2a9 n2’ eae ( ) 


Note that H©) is a non-relativistic Hamiltonian: the speed of light does not enter in it, 
and the kinetic term is that of Newtonian mechanics. The energy scale relevant to the 
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bound state spectrum can be better appreciated using the speed of light to consider both 
the fine structure constant and the rest energy of the electron. The fine structure constant 
is given by 


az—ey (2.1.4) 


This states that the energy scale of hydrogen bound states is a factor of a? smaller than 
the rest energy of the electron, that is, about 19000 times smaller. We can thus rewrite the 
possible energies as: 


1 
ie Ree 
Ey = —3 0° me i (2.1.6) 
The typical momentum in the hydrogen atom is 


h me? e? 


~— = — = >SmM ~ a(me), 2.1.7 

Pe. ' - p = a(me) (2.1.7) 

which, written as p ~ m(ac) says that the typical velocity is v ~ ac, which is low enough 
that the non-relativistic approximation is fairly accurate. Finally, we note that 


he hol x 
m = ey es 2.1.8 
" mane mc a a ( ) 
which says that the Bohr radius is over a hundred times bigger than the (reduced) Compton 


wavelength of the electron. 


The degeneracy of the hydrogen atom spectrum is completely quantified by the relation 
n=N+041. (2.1.9) 


Here N > 0 is the degree of a polynomial in r that appears in a wavefunction where the 
leading dependence on r near the origin is factored out. The quantum number ¢ > 0 is the 
angular momentum of the state. For each fixed n, you can see that the number ¢ ranges 
from zero to n — 1. And for each fixed @ the eigenvalue of L, is mh with m ranging from 
—£ up to é: 


WS D se oss £=0,1,...,n—-1 
n-1 
m=—,...,£ # of states with energy E =S-(2e+1) =n? 
£=0 


The states of hydrogen are shown in this energy diagram, which is not drawn to scale, 
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n=2 a ————az 
N=1 N=0 

n=1 ar) 
N=0 


The table features the commonly used notation where capital letters are used to denote 
the various values of the orbital angular momentum ¢. If we have L denote the generic 
capital letter for angular momentum we have L(¢) where 


LCOS) LCL =P). ECH= DAD. BOS3) SF sod (2.1.10) 


Thus, for example, an S state is a state with € = 0, a P state is a state with @=1, anda 
D state is a state with @ = 2. 

Any hydrogen eigenstate specified by the three quantum numbers n, ¢,m, because, as it 
follows from (2.1.9), the value of N is then fixed. The wavefunction takes the form 


r 


£L 
Unt po =A (=) . (Potynomiat in — of degree x) “i. PO Vp OD) 5 Galt) 
ao ao 


where A is a normalization constant and N = n— (€+1). If you look at the wavefunction, 
the value of n can be read from the exponential factor. The value of @ can be read from 
the radial prefactor, or from the spherical harmonic. The value of m can be read from 
the spherical harmonic. For the ground state n = 1, 4 = 0 and m = 0. The normalized 
wavefunction is 


1 pei 
e- %.., (2.1.12) 


~1,0,0(r) = 


Comments: 


1. There are n? degenerate states at any energy level with principal quantum number 
n. This degeneracy explained by the existence of a conserved quantum Runge-Lenz 
vector. For a given n the states with various ¢’s correspond, in the semiclassical 
picture, to orbits of different eccentricity but the same semi-major axis. The orbit 
with @ = 0 is the most eccentric one and the orbit with maximum @ = n — 1 is the 
most circular one. 
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2. For each fixed value of @, the states have increasing N as we move up in energy. The 
number N is the number of nodes in the solution of the radial equation, that’s why 
it is the degree of the polynomial in r that features in the wavefunction (2.1.11). 


3. The analysis of H) so far ignored electron spin. Since the electron is a spin one- 
half particle there is an extra degeneracy: each of the H) eigenstates is really two 
degenerate states, one with the electron spin up and the other with the electron spin 
down. These states are degenerate because H) has no spin dependence. 


4. We will have to supplement H©) with terms that correspond to corrections that arise 
from relativity and from the spin of the electron. This will be the main subject of 
the following analysis. It will determine the fine-structure of the hydrogen atom. The 
corrections will break much of the degeneracy of the spectrum. 


5. In order to understand better the spectrum and the properties of the Hydrogen atom 
one can apply an electric field, leading to the Stark effect or a magnetic field, leading 
to the Zeeman effect. These external fields are represented by extra terms in the 
hydrogen atom Hamiltonian. 


Let us now discuss two different choices of basis states for the hydrogen atom, both of 
which include the electron spin properly. 

Recall that, in general, for a multiplet of angular momentum j, we have states (j,m,), 
with m; running from —j to j in integer steps. All states in the multiplet are J? eigenstates 
with eigenvalue h?j(j +1) and, for each state, hm, is the eigenvalue of ty: 

Because the electron has spin one half, its states are labeled as 


(2.1.13) 


NIK 


Gms) With @= 5% tg 


In the hydrogen atom the angular momentum @ can take different values, but the spin of 
the electron is always one-half. As a result, the label s is often omitted, and we usually only 
record the value of m,. For hydrogen basis states we thus have quantum numbers n, ¢, m¢, 
and ms. To avoid confusion, we have added the @ subscript to me, thus emphasizing that 
this is the azimuthal quantum number for orbital angular momentum. Since we are not 
combining the electron spin to its orbital angular momentum, the states form the “uncoupled 
basis” : 


Uncoupled basis quantum numbers: = (n, 4, mz, Ms) - (2.1.14) 


The states are completely specified by these quantum numbers. As we let those quantum 
numbers run over all possible values we obtain an orthonormal basis of states. 

It is often useful to use an alternative basis where the states are eigenstates of J2 and Jz, 
where the total angular momentum J is obtained by adding the orbital angular momentum 
L to the spin angular momentum 8S: 


J = L+s. (2.1.15) 
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When we form ¢® s we are tensoring a a full @ multiplet to an s multiplet (here, of course, 
s = 1/2). All states in 0 s are eigenstates of L? and eigenstates of S?, so 0 and s are good 
(constant) quantum numbers for all 7 multiplets that arise in the tensor product. Each j 
multiplet has states with quantum numbers (j,m,;). 

The coupled basis is one where states are organized into 7 multiplets. While states are 
no longer L, nor S$, eigenstates they are still L? eigenstates, thus the @ quantum number 
survives. The coupled basis quantum numbers are therefore 


Coupled basis quantum numbers: — (n, £,j,m,;) . (2.1.16) 


The (mg, ms) quantum numbers of the uncoupled basis have been traded for (j,m,;) quan- 
tum numbers and we have kept the n,@ quantum numbers. The coupled states are linear 
combinations of uncoupled states that involve different values of mg and ms, those combi- 
nations that yield the same value of mj; = mg + ms. 

To find the list of coupled basis states we must tensor each € multiplet in the hydrogen 
atom spectrum with the spin doublet 5. The rules of addition of angular momentum imply 
that we find two 7 multiplets: 


é@i = G=l+A OG =0-}). (231.17) 


For £ = 0, we only obtain a 7 = 1/2 multiplet. We use the notation L; for the coupled 
multiplets, with L = S,P,D,F for @ = 0,1,2, and 3 (see (2.1.10). The change of basis is 
summarized by the replacements 


l@z > BU), ge ® TAO) ge (2.1.18) 
or more explicitly, 
08 ; > Si 
1@4 > Ps@Pi 
(2.1.19) 
285 + Ds@Dz 
385 2 FrOFs 
+ multiplet, 


Thus, by the time we combine with electron spin, each ¢ = 0 state gives one j = 2 
each £ = 1 state gives j = 3 and j = $ multiplets, each ¢ = 2 state gives 7 = 5 and j = 3 
multiplets, and so on. For hydrogen, the principal quantum number is placed ahead to 


denote the coupled multiplets by 


Coupled basis notation for multiplets: |nL; (2.1.20) 


Using this notation for coupled basis multiplets the diagram of hydrogen atom energy 
eigenstates becomes: 
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n= 4 Z — 
S 
n=3| 3 3P3 3Ds 
(2) (6) 3P: 10) 3Ds 
2. 
281 
a 2 
(2) (6) 2Pi 
1S, 
2 


The number of states is indicated in parenthesis. 


2.2 The Pauli equation 


In the hydrogen atom the spin-orbit coupling arises because the electron is moving in the 
electric field of the proton. Since the electron is moving relative to the frame where we have 
a static electric field, the electron also sees a magnetic field B. The spin-orbit coupling is 
the coupling —y- B of that magnetic field to the magnetic dipole moment yp of the electron. 
We have discussed before, in the context of the Stern-Gerlach experiment, the value of 
the magnetic dipole moment of the electron. Recall the logic we used. In Gaussian units, 
the classical magnetic moment of a planar current loop is given by pw = fa, where I is the 
current and a is the area vector associated with the loop. From this one quickly derives 
that for a uniformly rotating particle with charge q and mass m the magnetic moment is 


pee (2.2.1) 
2mc 
where L is the angular momentum due to the rotation. For an elementary particle, this 
motivates the following relation between the spin angular momentum operator S and the 
magnetic moment 
qd A 


uM = gS (2.2.2) 
2mc 


where g is a factor that is added to parameterize our ignorance; after all there is no reason 
why the classically motivated formula for the magnetic dipole should hold in the quantum 
domain of spin. As it turns out, for an electron one has g = 2. Since g = —e for an electron, 
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we have 4 

-—e€ «4 eh § eh eh 
S = -2 — = +2 to = 2.2.3 

2WMeC 2mMec h Mec 2 a ( ) 


= 92 = 
# 2MeC 


For numerical applications we note that the Bohr magneton wp is defined by 
eh 


. 
n= ~ 9.274 x 1072! = 67910 tes 
2MeC gauss gauss 


Ze (2.2.4) 


(for SI values use Tesla = 10* gauss). The coupling of an electron to an external magnetic 
field is therefore represented by a Hamiltonian Hg given by 


(2.2.5) 


Our goal now is to show that this coupling, and its associated prediction of g = 2, arises 
naturally from the non-relativistic Pauli equation for an electron. 
Consider first the time-independent Schrodinger equation for a free particle: 


sv = Ey. (2.2.6) 


Since a spin one-half particle has two degrees of freedom, usually assembled into a column 
vector y, the expected equation for a free spin one-half particle is 
p° x 
—yx = Ey with y= ( 7 ; (222.7) 
2m X2 
One sometimes calls x a Pauli spinor. Note there’s an implicit two-by-two identity matrix 
Toyx2 in the Hamiltonian 
<2 
p 
H = —tI1 : 2.2.8 
2m 2x2 ( ) 
We can rewrite this Hamiltonian using Pauli matrices if we recall the identity 


(o -a)(o-b) = a-bllox2+io-(axb), (2.2.9) 


valid for arbitrary vector operators a and b. Taking a = b = p, with p the momentum 
operator, and recognizing that p x p = 0, we have 


(o-p)-(o-p) = Pp? Toxo. (2.2.10) 


This means that the Hamiltonian (2.2.8) can be rewritten as 


H = —(o-p)(o-p). (2.2.11) 
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So far, this is all just rewriting with no change in physics. But new things happen when we 
the particle is charged and we couple it to external electromagnetic fields. The quantum 
mechanical rule is that this inclusion can be taken care with the replacement 


po#we= p-<A. (2.2.12) 
Here gq is the charge of the particle and A is the external vector potential, a function 
of position that becomes an operator A(x) since position is an operator. In addition, if 
there is a electromagnetic scalar potential ® it contributes an additional term g®(x) to the 
Hamiltonian. 
With the replacement (2.2.12) applied to the Hamiltonian (2.2.11), and the inclusion of 
the coupling to the scalar potential, we get the Pauli Hamiltonian: 


o-i)(o- 7) + q(x). (2.2.13) 


1 
Om | 


AP aul = 


This time, using the identity (2.2.9), the second term survives 


a = Cees eee (2.2.14) 


We have a x 7 # O because the various 7; do not commute. Note that the replacement 
(2.2.12) applied to the original Hamiltonian (2.2.8) would not have given us the 7 x 7 term. 
To evaluate that term we use 


(7 x T) k = €igkTiT; = Feigjk Ti, 75] * (2.2.15) 
The commutator here is 
qd qd 
[ ae eg = [pi—24:, pj - 24; < (2.2.16) 


As usual, the p components can be thought of as derivatives acting on the spatially de- 
pendent components of A. Moreover, the A;’s being only functions of position, commute 
among themselves and we have 


(ere = Ouaety= nm Clare (2.2.17) 
Therefore, back in (2.2.16) 
(wx Ww) = beaut (Q,A; — O;Ai) = Oey = ma (V x A), (2.2.18) 
leading to the elegant result: 
Aves Me (2.2.19) 
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This equation is a bit reminiscent of the equation L x L = ih, for angular momentum. 
Back in the Pauli Hamiltonian (2.2.14), leaving identity matrices implicit, and setting 
q = —e, we find 


P ; 
Pa —(p+£ A) pg = 6) 
2m c 2m c (2.2.20) 
1 (é e )’ eh B — 68(%) rn 
= — +- +—o -B- x). 
2m E Cc 2mc is 


The second term in this expanded Pauli Hamiltonian gives the coupling of the electron spin 
to the magnetic field and agrees precisely with the expected coupling (2.2.5). We thus see 
that the Pauli equation predicts the g = 2 value in the electron magnetic moment. 


2.3. The Dirac equation 


While the Pauli equation incorporates correctly the coupling of the electron spin to electro- 
magnetic fields, it is not a relativistic equation. As discovered by Dirac, to include relativity 
one has to work with matrices and the Pauli spinor must be upgraded to a four-component 
spinor. The analysis begins with the familiar relation between relativistic energies and 
momenta 

Be -pe=amMAaé + E=V/pe+md (2.3.1) 


This suggests that a relativistic Hamiltonian for a free particle could take the form 
A= V/p?? +m, (2.3.2) 


with associated Schrodinger equation 


ince = /p??+mcty. (2.3.3) 


It is not clear how to treat the square root so, at least for small velocities p << mc, the 
Hamiltonian can be expanded: 


~ 9 >) 1 a) 2 
A me 14 By =me |i E -3(4 ) +... 


(2.3.4) 


If we ignore the constant rest mass, the first term is the familiar non-relativistic Hamiltonian, 
and the next term is the first nontrivial relativistic correction. For small momenta we will 
treat that term as a perturbation. 

More elegantly, Dirac wanted to find a Hamiltonian linear in momenta and without 
square roots. This would be possible if one could write the relativistic energy as the square 
of a linear function of the momentum: 


op? + mc! = (ca p+ Bmce’)? = (carp; + carp, + cagp3 + Bmce’)? . (2.3.5) 
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Expanding the right-hand side and equating coefficients one finds that the following must 
hold 


aj = 03 = a3 = 6? = 1, 
4A; + AQAA = {aj;, aj} =0, 2 # 75 (2.3.6) 
a8 + Ba; = {ai,B} =0. 


The relations on the second and third lines imply that a@’s and 6’s can’t be numbers, because 
they would have to be zero. It turns out that a@’s and §’s are four-by-four hermitian matrices: 


as & ) , B= é 1D O37) 


Using (2.3.5), the Dirac Hamiltonian is simply the linear function of momentum that is the 
square root of c?p? + m?c*. We thus have 


Hpirac = Ca- pt Bmce?. (2.3.8) 
The Dirac equation is 
ow 
ha = (ca ‘pt Bmce”) WV, (2.3.9) 


where W is a Dirac spinor, a four-component column vector that can be thought to be 
composed by two two-component Pauli spinors y and 7: 


r(x Ga) = Ge) — 


The coupling to electromagnetic fields is done as before 


ince = om (p + <A) + Bm? + V(r) jy. (2.3.11) 


where the coupling of the electron to the scalar potential ®(r) is included via 


e2 


V(r) = —e®(r) = ——. (2.3.12) 
r 
The great advantage of the Dirac equation (2.3.11) is that the corrections to the hydrogen 
Hamiltonian H) can be derived systematically by finding the appropriate Hamiltonian H 
that acts on the Pauli spinor y. The analysis, can be done with A = 0, since the stationary 
proton creates no vector potential. The result of the analysis shows that 


Aya hy. (2.3.13) 


where: 


1 1 
Woy 


Im2c r dr 


6 H spin-orbit 
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The first correction is the relativistic energy correction anticipated earlier. The second is 
the spin-orbit coupling, and the third is the Darwin correction, that as we shall see affects 
only ¢ = 0 states. 


Recall that the energy scale for H) eigenstates is a2mc?. We will now see that all the 
above energy corrections are of order a*mc? thus smaller by a factor of a? ~ su than the 
zeroth-order energies. This suggests that for the hydrogen atom, the role of the unit-free 
parameter of perturbation theory is taken by the fine structure constant: \ ~ a’. Of 


course, in reality we cannot adjust the value of a? nor we can take it to zero. 


For the relativistic correction, recalling that p ~ amc, we indeed have 


4 
p 
OAyel, = =a —a‘me’. (2.3.15) 
For spin-orbit we first rewrite the term, using 
1dV_i1d e? hee (2.3.16) 
rdr  rdr rj) rs? 7 
so that 
e * 11 
0 Aspin-orbit = pe ree) -L. (2.3.17) 


For an estimate we set S-L ~ h?, r ~ ao, and recall that ag = ai: 


2 +2 2 
e Ah ahe h h\3 9 De 
6 A gpin-orbit ~ Wee ae = lee ae = (—) c = ame. (2.3.18) 
We can evaluate the Darwin term using V = —e?/r: 
e2h2 ‘ 1 e2h2 T e2h2 
b6Hparwin = Gee ¥ (=) = Bizen (475) = 9 mice (xr). (2.3.19) 


To estimate this correction note that, due to the 6 function the the integral in the expecta- 
tion value will introduce a factor |~(0)|? ~ aj°. We will therefore have 


e2h? 
3 


meas ~ a’ me’, (2.3.20) 


0 Aparwin tars 
as this is exactly the same combination of constants that we had for spin orbit above. 


2.4 Fine structure of hydrogen 


The fine structure of hydrogen is the spectrum of the atom once one takes into account the 
corrections indicated in (2.3.14). After the partial simplifications considered above we have 


) 4 2 272 
p p e S§-L nm eth 
A= —4+V- — —. —— ——._— 9 ; 2.4.21 
2m Pa 8m3c2 Imre r 2 mec? te. ( ) 
—— —— 


H() OAyel. OH spin-orbit 6HDarwin 
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We will study each of these terms separately and then combine our results to give the fine 
structure of hydrogen. There are further smaller corrections that we will not examine here, 
such as hyperfine splitting and Lamb effect. 


2.4.1 Darwin correction 


Let us now evaluate the Darwin correction. Since this interaction has a delta function 
at the origin, the first order correction to the energy vanishes unless the wavefunction is 
non-zero at the origin. This can only happen for nS' states. There is no need to use the 
apparatus of degenerate perturbation theory. Indeed, for fixed n there are two orthogonal 
£ = 0 states, one with electron spin up and one with electron spin down. While these states 
are degenerate, the Darwin perturbation commutes with spin and is therefore diagonal in 
the two-dimensional subspace. There is no need to include the spin in the calculation and 


we have 242 
1 Te 
ENOoparwin = (Pn0015HDarwinlPno0) = 5—y-5l¥n00(0)|?- (2.4.22) 
As is shown in the homework, the radial equation can be used to determine the value of the 
nS wavefunctions at the origin. You will find that 


1 


\Ynoo(0)|? = —s3. (2.4.23) 
ana 
As a result 
(1) _ €& 1 ana 9, 1 
T, o0- arcs a Im22 apn? =a (mc \53 . (2.4.24) 


This completes the evaluation of the Darwin correction 


The Darwin term in the Hamiltonian arises from the elimination of one of the two two- 
component spinors in the Dirac equation. As we will show now such a correction would arise 
from a nonlocal correction to the potential energy term. It is as if the electron had grown 
from point-like to a ball with radius of order its Compton wavelength oe The potential 
energy due to the field of the proton must then be calculated by integrating the varying 
electric potential over the charge distribution of the electron. While a simple estimate of 
this nonlocal potential energy does reproduce the Darwin correction rather closely, one must 
not reach the conclusion that the electron is no longer a point particle. Still the fact remains 
that in a relativistic treatment of an electron, its Compton wavelength is relevant and is 
physically the shortest distance an electron can be localized. 


The potential energy V(r) of the electron, as a point particle, is the product of the 
electron charge (—e) times the electric potential ®(r) created by the proton: 


V(r) = (—e)®(r) = (—e) =. (2.4.25) 


Let us call V(r) the potential energy when the electron is a charge distribution centered 
at a point r with |r| = r (see Figure 2.1). This energy is obtained by integration over the 
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Figure 2.1: A Darwin type correction to the energy arises if the electron charge is smeared over a 
region of size comparable to its Compton wavelength. Here the center of the spherically symmetric 
electron cloud is at P and the proton is at the origin. The vector u is radial relative to the center 
of the electron. 


electron distribution. Using the vector u to define position relative to the center P of the 
electron, and letting p(u) denote the position dependent charge density, we have 


V(r) = | du p(u)®(r + u), (2.4.26) 
electron 

where, as shown in the Figure, r+ u is the position of the integration point, measured 

relative to the proton at the origin. It is convenient to write the charge density in terms of 

a normalized function po: 


p(u) = —epo(u) > du po(u) =1, (2.4.27) 
electron 


which guarantees that the integral of p over the electron is indeed (—e). Recalling that 
—e®(r + u) = V(r+u) we now rewrite (2.4.26) as 


Vir) = du po(u)V(r +). (2.4.28) 
electron 

This equation has a clear interpretation: the potential energy is obtained as a weighted 

integral of potential due to the proton over the extended electron. If the electron charge 

would be perfectly localized, pp(u) = 6(u) and V(r) would just be equal to V(r). We will 

assume that the distribution of charge is spherically symmetric, so that 


po (wu) = po(u). (2.4.29) 
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To evaluate (2.4.31), we first do a Taylor expansion of the potential that enters the integral 
about the point u = 0: 


Vrtu)=V() + aV| wt 5 8:0jV| ie Hiss (2.4.30) 
i ij 


All derivatives here are evaluated at the center of the electron. Plugging back into the 
integral (2.4.31) and dropping the subscript ‘electron’ we have 


Vee / au po(u)(V (®) + 2 AV| ae + 5 32 8:;V| wen Pevle (HS) 
; ij 


All r dependent functions can be taken out of the integrals. Recalling that the integral of 
po over volume is one, we get 


V(r) = V(r) + SS av | [eu po (u) pi + 4 s d:0;V| [eu po (u) ujzuy+... (2.4.32) 
i 5 ij z 
Due to spherical symmetry the first integral vanishes and the second takes the form 


[eu PO (u) UjU; = 40ij [en PO (u) u . (2.4.33) 


Indeed the integral must vanish for i 4 7 and must take equal values for i = j = 1,2,3. 
Since u? = u} + u3 + uf, the result follows. Using this we get 


V(r) = V(r) + +>29:0:V| £ | @usca)p® +... 
: (2.4.34) 
= V(r) + bv f du po (a) w? ee 
The second term represents the correction 6V to the potential energy: 
Ov = iwy [du po (u) u?. (2.4.35) 


To get an estimate, let us assume that the charge is distributed uniformly over a sphere of 
radius up. This means that po(u) is a constant for u < ug 


3 1, uU< Uo, 
po(u) = ae (2.4.36) 
m™U9 | 0, u> ug. 
The integral one must evaluate then gives 
uo 4 2d 2 3 Uo 
[bootup = | —", = 3 udu = Sup. (2.4.37) 
0 3 UO Uo JO 
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Therefore, 
6V = uv’V. (2.4.38) 


If we choose the radius ug of the charge distribution to be the Compton wavelength a of 
the electron we get, 
h2 
10 m2c? 
Comparing with (2.3.14) we see that, up to a small correction (3 as opposed to an): this 
is the Darwin energy shift. The agreement is surprisingly good for what is, admittedly, a 
heuristic argument. 


6V = VV. (2.4.39) 


2.4.2 Relativistic correction 


We now turn to the relativistic correction. The energy shifts of the hydrogen states can 
be analyzed among the degenerate states with principal quantum number n. We write 
tentatively for the corrections 
(1) 1 
Le perce = er) (Wnempm,|P” p *lDneme) : (2.4.40) 

We can use this formula because the uncoupled basis of states at fixed n is good: the 
perturbation is diagonal in this basis. This is checked using Remark 3 in section 1.2.2. This 
is clear because the perturbing operator p?p? commutes with L?, with L,, and with Sy. 
The first operator guarantees that that the matrix for the perturbation is diagonal in @, the 
second guarantees that the perturbation is diagonal in m,, and the third guarantees, rather 
trivially, that the perturbation is diagonal in m,. 

To evaluate the matrix element we use the Hermiticity of p? to move one of the factors 
into the bra 


1 1 
EX ee — 35 (P’Untm|P'Ynem) , (2.4.41) 


where in the right-hand side we evaluated the trivial expectation value for the spin degrees 
of freedom. To simplify the evaluation we use the Schrodinger equation, which tells us that 


2 
(F+V)Unem = EO bvtm Prt = 2m(EO) — Vat (2.4.42) 


Using this both for the bra and the ket: 


1 
Sy een = ~ Ime ( (EO) =< Vbntm| (EL? = V)bnem ) . (2.4.43) 


(0) 


The operator E,,’ — V is also Hermitian and can be moved from the bra to the ket, giving 


1 
EO nema = yee (Yntm|( (HL)? — 2V EO + V*)| atm) 
1 (2.4.44) 
= T Sines (zo)? — 2Ep(V)ntm + (V?)ntm| . 
me 
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The problem has been reduced to the computation of the expectation value of V(r) and 
V7(r) in the Unem state. The expectation value of V(r) is obtained from the virial theorem 
that states that (V) = 26). For V(r) we have 


1 1 e? 1 a 4n 4n 
(v2) = (5) — “aah = (<<) = (BE). : (2.4.45) 
0 2 


Back into (2.4.44) we find 


EY) = — 3; = B18 (2.4.46) 


n,lmems;rel 


The complete degeneracy of @ multiplets for a given n has been broken. That degeneracy 
of H) was explained by the conserved Runge-Lenz vector. It is clear that the relativistic 
correction has broken that symmetry. 

We have computed the above correction using the uncoupled basis 


1) ae = (nimi, OH lining) =F Ot, 0) (2.4.47) 
Here we added the extra equality to emphasize that the matrix elements depend only on n 
and @. We have already seen that in the full degenerate subspace with principal quantum 
number n the matrix for 6H;., is diagonal in the uncoupled basis. But now we see that in 
each degenerate subspace of fixed n and £, 6H;¢ is in fact a multiple of the identity matrix, 
since the matrix elements are independent of m and m, (the LZ, and S$) eigenvalues. A 
matrix equal to a multiple of the identity is invariant under any orthonormal change of 
basis. For any (@} multiplet, the resulting 7 multiplets provide an alternative orthonormal 
basis. The invariance of a matrix proportional to the identity implies that 


Hee ca = (nljm;|OHyeal|nejm;) = f(n,£). (2.4.48) 
with the same function f(n, @) as in (2.4.47), and the perturbation is diagonal in this coupled 
basis too. This is clear anyway because the perturbation commutes with L?, J? and J, and 
and any two degenerate states in the coupled basis differ either in @, 7 or jz. 

The preservation of the matrix elements can also be argued more explicitly. Indeed, 
any state in the coupled basis is a superposition of orthonormal uncoupled basis states with 
constant coefficients c;: 


|néjm;) = S| cilnémim’) , with ye ds (2.4.49) 


} 


because the state on the left-hand side must also have unit norm. Therefore, using the 
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diagonal nature of the matrix elements in the uncoupled basis we get, as claimed 


(néjm;|b Are |\nejm,;) SS cc, (nemim*|6 Hrei|nemem*) 
i,k 


S- lei? (némpm’|5 Hreilnempm') (2.4.50) 


= DilePF(m,e) = Fao. 


2.4.3 Spin orbit coupling 


The spin-orbit contribution to the Hamiltonian is 


| 


—~~ —§- 2.4.51 
2m2c?2 r3 ( ) 


OH, spin-orbit = 


Note that dA spin-orbit commutes with L? because L? commutes with any im and any Si. 
Moreover, dAgpin-orbit Commutes with J? and with J, since, in fact, [ee S-L] = 0 for 
any 7; S-L is a scalar operator for J. It follows that dA spin-orbit is diagonal in the level 
n degenerate subspace in the coupled basis |n@jm,;). In fact, as we will see, the matrix 
elements are m-independent. This is a nontrivial consequence of 6H spin-orbit being a scalar 
under J. To compute the matrix elements we recall that J=S+L and 


e” 22.4 
Bo de et, = Im2c2 (nejmj| 5S : L|nnéjm;) 
Co FEY ns 3 1 
= Sede [7G +1) - +1) 3 (mejmg|—|n€jm,;). (2.4.52) 


We need the expectation value of 1/r? in these states. It is known that 


1 


1 
nebime| = |néme) = ————_____—_.. 2.4.53 
( rs ndagl (C+ 4) (+1) ( ) 
Because of the m¢ independence of this expectation value (and its obvious ms independence) 
the operator 1/r® is a multiple of the identity matrix in each 0@ $ multiplet. It follows that 


it is the same multiple of the identity in the coupled basis description. Therefore 


1 1 
}n 5 | mn; ) = — ~——__———_. 2.4.54 
(ntimi] Inti) ndape (¢+ 4) (+1) Cee 
Using this in (2.4.52) 
me [GG +1) - +1) — 3) (2.4.55) 
néjm,; spin-orbit 4m?2c2 nape (¢ + 1) (¢ ae 1) 
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) 


Working out the constants in terms of Ee and rest energies we get 


(0)\2 7 [ara 3 

ae in-orbit — es ) 7 iG 2 z 1 at a = i ’ U a 0. (2.4.56) 
J™m3j3SP mc £(€+5) (+1) 
Since L vanishes identically acting on any @ = 0 state, it is physically reasonable, as we will 
do, to assume that the spin-orbit correction vanishes for € = 0 states. On the other hand 
the limit of the above formula as 2 > 0, while somewhat ambiguous, is nonzero. We set 
jg=lt $ (the other possibility 7 = @ — 5 does not apply for = 0) and then take the limit 
as £ — 0. Indeed, 


mL, CP a [+ gE + 3) = E+ =F 
mlamsisoicotitlsaeng mee A) +1) easn 
02 : A. 
me (¢+4)(€+1)’ 
and now taking the limit: 
(0))2 
: (1) _ (En’) ee ee. 
lim Eira cesinoaet ea ae (2n) = a* me ore (2.4.58) 


We see that this limit is in fact identical to the Darwin shift (2.4.24) of the nS states. This 
is a bit surprising and will play a technical role below. 
2.4.4 Combining results 


For € # 0 states we can add the energy shifts from spin-orbit and from the relativistic 
correction, both of them expressed as expectation values in the coupled basis. The result, 
therefore will give the shifts of the coupled states. Collecting our results (2.4.46) and (2.4.56) 
we have 


(nejmj) Hye + OAspin-orbit nbjmy) 
(Eo? {2- dn Qn [{(F +1) — e+) - 3] 
( + 


a +3) (lg) (OD) (2.4.59) 
Gy jG +1) — 3641) -3 


These are the fine structure energy shifts for all states in the spectrum of hydrogen. The 
states in a coupled multiplet are characterized by ¢, 7 and m; and each multiplet as a whole 
is shifted according to the above formula. The degeneracy within the multiplet is unbroken 
because the formula has no m; dependence. This formula, as written, hides some additional 
degeneracies. We uncover those next. 
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In the above formula there are two cases to consider for any fixed value of j: the multiplet 
can have ¢ = j — 5 or the multiplet can have @= 7 + 5. We will now see something rather 
surprising. In both of these cases the shift is the same, meaning that the shift is in fact @ 
independent! It just depends on j. Call f(j, 2) the term in brackets above 


} jj +1) — 3€(€+1) -3 
) = 2.4.60 
G8 (+4) (+1 Cae 
The evaluation of this expression in both cases gives the same result: 
40,0, = SOTA EBS Soa, ATT a ae 
News G-DiG+h | ~7G-DG+N Grd 
50.0) 2 FOE SU a0 Pea. PP aap 8 8 8 
Ment” G+DGtDG+H)  — GFDG+DG+H GD 
(2.4.61) 


We can therefore replace in (2.4.59) the result of our evaluation, which we label as fine 
structure (fs) shifts: 


(0)\2 
(1) _ _(En’)* 7 4n os. ozs ed n 3 
nbj,myifs ~~ Ped j Ss ] 3 | = —a"(mc Peal : +i | : (2.4.62) 


More briefly we can write 


ae) = —atme 8,5, (2.4.63) 


ntj,m;;fine 


Let us consider a few remarks: 


1. The dependence on 7 and absence of dependence on @ in the energy shifts could be 
anticipated from the Dirac equation. The rotation generator that commutes with the 
Dirac Hamiltonian is J = L+S, which simultaneously rotates position, momenta, and 
spin states. Neither L nor S are separately conserved. With J a symmetry, states are 
expected to be labelled by energy and j and must be m; independent. 


2. The formula (2.4.63) works for nS states! For these ¢ = 0 states we were supposed 
to add the relativistic correction and the Darwin correction, since their spin-orbit 
correction is zero. But we noticed that the limit £— 0 of the spin-orbit correction 
reproduces the Darwin term. Whether or not this is a meaningful coincidence, it 
means the sum performed above gives the right answer for ¢ — 0. 


3. While a large amount of the degeneracy of H) has been broken, for fixed n, multiplets 
with the same value of 7, regardless of @, remain degenerate. The states in each j 
multiplet do not split. 
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(2.4.64) 


4. Since S;,; > 0 all energy shifts are down. Indeed 
n n n n n Gc 4 
ut a oS Pe OP eer ee 
J+ Jmax + 5 Emax + 3 +95 n ITS 
5. For a given fixed n, states with lower values of 7 get pushed further down. As n 


increases splittings fall off like n~°. 


A table of values of S;,,; is given here below 


Fine Structure Spectrum 
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For the record, the total energy of the hydrogen states is the zeroth contribution plus 
the fine structure contribution. Together they give 


2 


2 
e 1 ray n 
Eine; =-3-5|1 —(—-4)]- 2.4.65 
ntims “Dag LT ne Graf aie 


This is the fine structure of hydrogen! There are, of course, finer corrections. The so- 
called Lamb shift, for example, breaks the degeneracy between 2S} /2 and 2P,/2 and is of 
order a°. There is also hyperfine splitting, which arises from the coupling of the magnetic 
moment of the proton to the magnetic moment of the electron. Such coupling leads to a 
splitting that is a factor m_/m, smaller than fine structure. 


2.5 Zeeman effect 


In remarkable experiment done in 1896, the Dutch physicist Pieter Zeeman (1865-1943) 
discovered that atomic spectral lines are split in the presence of an external magnetic field. 
For this work Zeeman was awarded the Nobel Prize in 1902. The proper understanding of 
this phenomenon had to wait for Quantum Mechanics. 

The splitting of atomic energy levels by a constant, uniform, external magnetic field, the 
Zeeman effect, has been used as a tool to measure inaccessible magnetic fields. In observing 
the solar spectrum, a single atomic line, as seen from light emerging from outside a sunspot, 
splits into various lines inside the sunspot. We have learned that magnetic fields inside a 
sunspot typically reach 3,000 gauss. Sunspots are a bit darker and have lower temperature 
than the rest of the solar surface. They can last from hours to months, and their magnetic 
energy can turn into powerful solar flares. 

The external magnetic field interacts with the total magnetic moment of the electron. 
The electron has magnetic moment due to its orbital angular momentum and one due to 
its spin 

j= 2S ay. Gig S Ses, (2.5.1) 


where we included the g = 2 factor in the spin contribution. The Zeeman Hamiltonian is 
thus given by 

€ 
2me 
Conventionally, we align the magnetic field with the positive z axis so that B = Bz and 
thus get 


OH eéenan = —(pe ate Ls) ‘B= (L + 2S) -B. (2:52) 


6 Zeeman a ee (Lz + 25',) : (2.5.3) 


When we consider the Zeeman effect on Hydrogen we must not forget fine structure. 
The full Hamiltonian to be considered is 


H = H®) + 5H + 5H zeman - (2.5.4) 
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Recall that in fine structure, there is an internal magnetic field Bi, associated with spin- 
orbit coupling. This is the magnetic field seen by the electron as it goes around the proton. 
We have therefore two extreme possibilities concerning the external magnetic field B of the 
Zeeman effect: 


(1) Weak-field Zeeman effect: B << Bint. In this case the Zeeman effect is small compared 
with fine structure effects. Accordingly, the original Hamiltonian H) together with 
the fine structure Hamiltonian Hy, are thought as the “known” Hamiltonian H©), and 
the Zeeman Hamiltonian is the perturbation: 


H = H© + 6H +5Hzeeman - (2.5.5) 
—S_S 


FL(0) 


(2) Strong-field Zeeman effect: B >> Bint. In this case the Zeeman effect is much larger 
than fine structure effects. Accordingly, the original Hamiltonian H together with 
the Zeeman Hamiltonian are thought as the “known” Hamiltonian H) and the fine 
structure Hamiltonian H;, is viewed as the perturbation: 


H = H + 6H zceman +oH¢s - (2.5.6) 
— 


(0) 


You may thing that H) + 6Hzeeman does not qualify as known, but happily, as we 
will confirm soon, this is actually a very simple Hamiltonian. 


When the Zeeman magnetic field is neither weak nor strong, we must take the sum 
of the Zeeman and fine structure Hamiltonians as the perturbation. No simplification is 
possible and one must diagonalize the perturbation. 


Weak-field Zeeman effect. The approximate eigenstates of H) are the coupled states 
|néjm;) that exhibit fine structure corrections and whose energies are a function of n and J, 
as shown in the Fine Structure diagram. Degeneracies in this spectrum occur for different 
values of £ and different values of m,;. 
To figure out the effect of the Zeeman interaction on this spectrum we consider the 
matrix elements: 
(nljm5|5 Hzeeman|nljm',) : (25:7) 


Since dHzeeman ~ Lz + 2S we see that dA zceman commutes with L? and with ye: The 
matrix element thus vanishes unless ¢’ = ¢ and mi; =m, and the Zeeman perturbation is 
diagonal in the degenerate fine structure eigenspaces. The energy corrections are therefore 


(7) eh dhe i B sce ag 
ET iim _ oe (néjmj|(Lz + 252)|néjmj) =, (2.5.8) 


where we multiplied and divided by fh to make the units of the result manifest. The result 
of the evaluation of the matrix element will show a remarkable feature: a linear dependence 
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EO w~ hm; on the azimuthal quantum numbers. The states in each 7 multiplet split into 
equally separated energy levels! We will try to understand this result as a property of 
matrix elements of vector operators. First, however, note that Le +25, = i + Ss. and 
therefore the matrix element of interest in the above equation satisfies 


(néjm;|(Lz + 28.)|n€jm;) = im + (nejm,|Sz|\njm,;) . (2.5.9) 


It follows that we only need to concern ourselves with &, ', Imatrix elements. 

Let’s talk about vector operators. : The operator V is said to be a vector operator 
under an angular momentum operator J if the following commutator holds for all values of 
i,j =1,2,3: a 

[dest Vil Se egy: Vie (2.5.10) 
It follows from the familiar J commutators that J is a vector operator under J. Additionally, 
if V is a vector operator it has a standard commutation relation with J? that can be quickly 
confirmed: 
[J?,V] = 2n(V x J -inV). (2.5.11) 
If V is chosen to be J the left-hand side vanishes by the standard property of J? and the 
right-hand side vanishes because the J commutation relations can be written as Jx J = ind. 
Finally, by repeated use of the above identities you will show (homework) that the following 
formula holds 
1 RD. Re, Me Kee hoe Ae ue nents 
aa | PP Wl] = WI - $V EVI), 2.5.12 
onze V1 (VII - £(PV4+V02) (2.5.12) 
Consider (J?, J.) eigenstates |k; jm ;) where k stands for other quantum number that bear 
no relation to angular momentum. The matrix elements of the left-hand side of (2.5.12) on 
such eigenstates is necessarily zero: 


(k's imi Ba sae VI |k; jm;) = 0, (2.5.13) 


as can be seen by expanding the outer commutator and noticing that J? gives the same 
eigenvalue when acting on the bra and on the ket. Therefore the matrix elements of the 
right-hand side gives 


(k's imi |(V-S) Sky my) = jG +1) (KS gmi,[V ks jms), (2.5.14) 


which implies that 


(k!; jm!.\(V J) Jk; jms) 


BIGHT (2.5.15) 


This is the main identity we wanted to establish. Using the less explicit notation (---) for 
the matrix elements we have found that 


yaaa a i (2.5.16) 
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This is sometimes called the projection lemma: the matrix elements of a vector operator Vv 
are those of the conventional projection of V onto J. Recall that the projection of a vector 
v along the vector j is (v - j)j/j’. 

Let us now return to the question of interest; the computation of the the expectation 
value of S, in (2.5.20). Since S is a vector operator under J we can use (2.5.15). Specializing 
to the z-component 


hm; (néjm,|S- J nb jj) 
h? j(j +1) 


We already see the appearance of the predicted hm, factor. The matrix element in the 


(ne jm;|Sz|né jm;) = (2.5217) 


numerator is still to be calculated but it will introduce no m; dependence. In fact S-Jisa 
scalar operator (it commutes with all Ji ) and therefore it is diagonal in m;. But even more 
is true; the expectation value of a scalar operator is in fact independent of my! We will not 
show this here, but will just confirm it by direct computation. Since L=J-—S we have 


S-J = 4(7°+8’-L?’), (2.5.18) 
and therefore 


hm; 


(ne jmj|Sz|n€jm;) = CCR Cs ue ms 1) +). (2.5.19) 


Indeed, no further m; dependence has appeared. Back now to (2.5.20) we get 


ee ae eee jG +1) -eé+1) +4 

Ljm,|(Lz + 2S,)|nljm;) = hm, 1 + —————_—— } . 2.5.20 
(negjmj|( )Jmejrns) = is TEES eee (2.5.20) 

The constant of proportionality in parenthesis is called the Lande g-factor g,(¢): 

jG +1) -e€+1)+3 
(= 2.5;21 
@) 2j(j + 1) 
We finally have for the Zeeman energy shifts in (2.5.8) 
1 eh 

Bi = 5 B gst) mj (2.5.22) 


Here the Bohr magneton th ~ 5.79 x 10-°eV/gauss. This is our final result for the weak- 
field Zeeman energy corrections to the fine structure energy levels. Since all degeneracies 
within 7 multiplets are broken and j multiplets with different ¢ split differently due to the 
é dependence of g7(¢), the weak-field Zeeman effect removes all degeneracies! 


Strong-field Zeeman effect. We mentioned earlier that when the Zeeman effect is larger 
than the fine structure corrections we must take the original hydrogen Hamiltonian together 
with the Zeeman Hamiltonian to form the ‘known’ Hamiltonian H©: 


HO — p#O+4_~ (£,428,)B. (2.5.23) 
2mc 
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Actually H© is simple because the Zeeman Hamiltonian commutes with the zero-th order 
hydrogen Hamiltonian 

[i,+28,,H] = 0. (2.5.24) 
We can thus find eigenstates of both simultaneously. Those are in fact the uncoupled basis 
states! We have 


HOlnémems) = EO |\némems) 


ees (2.5.25) 
(Lz +2S8,)|\nlmems) = hime + 2m,)|nLmems) . 


and therefore the uncoupled basis states are the exact energy eigenstates of H (9) and have 
energies 


eh 
Entmyms = EY) + — B(m, + 2m). (2.5.26) 
2mc 
Some of the degeneracy of H) has been removed, but some remains. For a fixed principal 


quantum number n there are degeneracies among ¢ ® $ states and degeneracies among such 
multiplets with @ and ¢’ different. This is illustrated in Figure 2.2. 


Figure 2.2: Illustrating the degeneracies remaining for 0 = 0 and @ = 1 after the inclusion of Zeeman 
term in the Hamiltonian. Accounting for the spin of the electron there two degenerate states in the 
£=1 multiplet 1@ 4 and each of the two states in the @ = 0 multiplet 0 ® 4 is degenerate with a 
state in the = 1 multiplet. 


The problem now is to compute the corrections due to 6H; on the non-degenerate and 
on the degenerate subspaces of H). The non-degenerate cases are straightforward, but 
the degenerate cases could involve diagonalization. We must therefore consider the matrix 


elements 
nl'mpm', Oy. |nLmems) , 2.5.27 
L 


with the condition 
my + 2m’, = met 2mz, (2.5.28) 
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needed for the two states in the matrix element to belong to a degenerate subspace. Since 
L? commutes with 5H;; the matrix elements vanish unless ¢ = @’ and therefore it suffices to 
consider the matrix elements 


(n'mipm',| 6H¢s|n€mems) , (2.5.29) 


still with condition (2.5.28). Ignoring ¢ = 0 states, we have to re-examine the relativistic 
correction and spin orbit. The relativistic correction was computed in the uncoupled basis 
and one can use the result because the states are unchanged and the perturbation was 
shown to be diagonal in this basis. For spin-orbit the calculation was done in the coupled 
basis because spin-orbit is not diagonal in the original H) degenerate spaces using the 
uncoupled basis. But happily, it turns out that spin-orbit is diagonal in the more limited 
degenerate subspaces obtained after the Zeeman effect is included. All these matters are 
explored in the homework. 


Chapter 3 


Semiclassical approximation 


© B. Zwiebach 


3.1 The classical limit 


The WKB approximation provides approximate solutions for linear differential equations 
with coefficients that have slow spatial variation. The acronym WKB stands for Wentzel, 
Kramers, Brillouin, who independently discovered it in 1926. It was in fact discovered 
earlier, in 1923 by the mathematician Jeffreys. When applied to quantum mechanics, it is 
called the semi-classical approximation, since classical physics then illuminates the main 
features of the quantum wavefunction. 

The de Broglie wavelength 4 of a particle can help us assess if classical physics is relevant 
to the physical situation. For a particle with momentum p we have 


h 
eee Sula 
5 (3.1.1) 


Classical physics provides useful physical insight when » is much smaller than the relevant 
length scale in the problem we are investigating. Alternatively if we take h > 0 this will 
formally make \ —> 0, or A smaller than the length scale of the problem. Being a constant of 
nature, we cannot really make h — 0, so taking this limit is a thought experiment in which 
we imagine worlds in which h takes smaller and smaller values making classical physics more 
and more applicable. The semi-classical approximation studied here will be applicable if 
a suitable generalization of the de Broglie wavelength discussed below is small and slowly 
varying. The semi-classical approximation will be set up mathematically by thinking of h 
as a parameter that can be taken to be as small as desired. 


Our discussion in this chapter will focus on one-dimensional problems. Consider there- 
fore a particle of mass m and total energy EF moving in a potential V(x). In classical physics 
E—V‘(z) is the kinetic energy of the particle at x. This kinetic energy depends on position. 
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Since kinetic energy is ee this suggests the definition of the local momentum p(z): 


p(x) = 2m(E-V(z)). (3.1.2) 


The local momentum p(z) is the momentum of the classical particle when it is located at 
x. With a notion of local momentum, we can define a local de Broglie wavelength (x) 
by the familiar relation: 


h 2rh 
N(x) = 1 = m(a) : (3.1.3) 
The time-independent Schrédinger equation 
pao 
om age Ve) = (B-V(2))v(@). (3.1.4) 


—h—y = p*(x)p. (3.1.5) 


Using the momentum operator, this equation takes the suggestive form 


BP w(x) = p(x) (2). (3.1.6) 


This has the flavor of an eigenvalue equation but it is not one: the action of the momentum 
operator squared on the wavefunction is not really proportional to the wavefunction, it is 
the wavefunction multiplied by the ‘classical’ position-dependent momentum-squared. 


A bit of extra notation is useful. If we are in the classically allowed region, FE > V(x) 
and p(x) is positive. We then write 


p(x) = 2m(E—V(ax)) = hk?(2), (3.1.7) 


introducing the local, real wavenumber k(a). If we are in the classically forbidden region, 
V(x) > E and p?(z) is negative. We then write 


—p*(x) = 2m(V(x£) — E) = f'n? (x), (3.1.8) 
introducing the local, real K(x). 
The wavefunctions we use in the WKB approximation are often expressed in polar form. 


Just like any complex number z can be written as re’’, where r and @ are the magnitude 
and phase of z, respectively, we can write the wavefunction in a similar way: 


ED. oa) exp (7S(x. ‘)). (3.1.9) 
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We are using here three-dimensional notation for generality. By definition, the functions 
p(x,t) and S(x,t) are real. The function p is non-negative and the function S(x,t) as 
written, has units of h. The name p(x, t) is well motivated, for it is in fact the probability 
density: 

p(x,t) = |W(x,t)|?. (3.1.10) 


Let’s compute the probability current. For this we begin by taking the gradient of the 
wavefunction 


VU = -—e® + _VSU (3.1.11) 


We then form: 


1 ; 
WV = 5Vp+ = pvs (3.1.12) 
The current is given by 
h 
J=—Im(V*vy). (3.1213) 
m 
It follows that 
Tag (3.1.14) 
m 


This formula implies that the probability current J is perpendicular to the surfaces of 
constant S, the surfaces of constant phase in the wavefunction. 

In classical physics a fluid with density p(x) moving with velocity v(x) has a current 
density pv = pe. Comparing with the above expression for the quantum probability 
current, we deduce that 

p(x) ~ VS. (3.1.15) 


We use the ~ because this is an association that is not of general validity. Nevertheless, 
for ‘basic’ WKB solutions we will see that the gradient of S is indeed the classical local 
momentum! This association also holds for a free particle: 


Example. Consider a free particle with momentum p and energy E. Its wavefunction is 


(3.1.16) 


i) i) 


feat Bt 
¥(x,t) = exp | ze ote |; 


Here we identify the function S as S = p- x — Et and therefore VS = p. In this case VS 
is equal to the momentum eigenvalue, a constant. 


3.2. WKB approximation scheme 


Our aim here is to find approximate solutions for the wavefunction ~(x) that solves the 
time independent Schrdinger equation in one dimension. We wrote before the polar decom- 
position (3.1.9) of the wavefunction. It will be easier to set the approximation scheme by 
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using a single complex function S(x) to represent the time-independent wavefunction w(z). 
For this we use a pure exponential without any prefactor: 


v(x) = exp(+5()) , S(e) EC. (3:2:1) 


As before S must have units of h. This S(x) here is a complex number, because wavefunc- 

tions are not in general pure phases. The real part of S, divided by h, is the phase of the 

wavefunction. The imaginary part of S(a) determines the magnitude of the wavefunction. 
Let us plug this into the Schrédinger equation (3.1.5): 


2 ; . 
ee (cis) = p'(x)ekS@ | (3.2.2) 
XL 


Let us examine the left-hand side and take the two derivatives 


2 


d a d (i a icles (sig aa Ve 
FO -S(v)\ _ op 2“ f & of 5 (x) _ 2 S(a) 
h 7 (ct ) =—-h Tn (5s (x)er ) =—-h ( = 7p Je ; (3.2.3) 


Back into the differential equation and canceling the common exponential 


he (= a) Oe (3.2.4) 


A h? 


With minor rearrangements we get our final form: 


(S'(x))? — iS" (x) = p?(x). (3:25) 


The presence of an explicit iin the equation tells us that the solution for S, as expected, 
cannot be real. At first sight one may be baffled: we started with the linear Schrédinger 
equation for w and obtained a nonlinear equation for S$! This is actually unavoidable when 
the variable w is parameterized as an exponential and reminds us that sometimes a change 
of variables can turn a linear equation into a nonlinear one and viceversa. The nonlinear 
equation (3.2.5) allows us set up an approximation scheme in which h is considered small 
and thus the term involving S”(2) is small. We will argue that this is in fact true for slowly 
varying potentials. 
Claim: ihS” small if V(x) is slowly varying. 
Indeed, if V(a) = Vo is a constant, then the local momentum p(x) is equal to a constant po. 
Equation (3.2.5) is then solved by taking S’ = po. For this choice $” = 0 and the term ihS” 
vanishes identically for constant V. It should therefore be small for slowly varying V(x). 
Alternatively the term iiS” in the differential equation is small as h + 0, which makes 
the local de Broglie wavelength go to zero. In that case, the potential looks constant to the 
quantum particle. 
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We will thus take fh to be the small parameter in a systematic expansion of S(): 
S(x) = So(x) + AS1(x) + h?.So(x) + O(h?). (3.2.6) 


Here So, just like S, has units of A. The next correction S$; has no units, and the following, 
S has units of one over fi. Now plug this expansion into our nonlinear equation (3.2.5) 


(5), HAS ASE...) — ih (SE EAST EASE...) = p?@)i = 0: 82.7) 


The left-hand side is a power series expansion in h. Just as we argued for the parameter A 
in perturbation theory, here we want the left-hand side to vanish for all values of A and this 
requires that the coefficient of each power of f vanishes. We thus sort the left-hand side 
terms neglecting terms of order h? or higher. We find 


(So)? —p? + &(28)S1 —i8f) +O (A) =0 (3.2.8) 
This gives two equations, one for the coefficient of (h)° and another for the coefficient of hi: 


(So)? — p*(x) = 0, 


mae (3.2.9) 
2595} = 1S — 0. 
The first equation is easily solved: 
Sh=+p(z) > Sp(xr) = at p(a" dz (3.2.10) 
Xo 


where Zo is a constant of integration to be adjusted. The next equation allows us to find 
S$, which is in fact imaginary: 


, 289 _ i(tp'(z)) _ ip’ 
S=5 Ge oe (3.2.11) 


This is readily solved to give 
iSy(x) = —5ln p(x) +C’. (3.2.12) 
Let us now reconstruct the wavefunction to this order of approximation: 


eee (So shige o1n?)) bedi | 50| Kash (3.2.13) 


Using our results for Sg and S$; we have that the approximate solution is 


w(x) = exp [ee f pteae' exp [—4 logp(xz) + C’] . (3.2.14) 
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We thus have 


w(x) = ae exp lg [neo (3.2:15) 


This is the basic solution in the WKB approximation. We do not attempt to normalize this 
wavefunction because, in fact, the region of validity of this approximation is still unclear. 


Observables for the basic solution: 


i) Probability density: 
JA? _ IAP 


p=vy= a ae (3.2.16) 


where v(x) is the local classical velocity. Note that p is higher where v is small as 
the particle lingers in those regions and is more likely to be found there. This is an 
intuition we developed long ago and is justified by this result. 


il 


SS 


Probability current: In the language of the polar decomposition (3.1.9) of the wave- 
function the basic solution corresponds to 


S{e) = [ vleae’ (3.2.17) 


Note that, as anticipated below equation (3.1.15) the gradient of S in the basic solution 
is the local momentum. Recalling the result (3.1.14) for the current, 


J = p——, (3.2.18) 


and therefore, 
_ AP pla) _ [AP 
p(x) m m 


J (i) 


(3.2.19) 


The fact that the current is a constant should not have taken us by surprise. A 
position-dependent current for an energy eigenstate is not possible, as it would violate 
the current conservation equation 0,J(x) + O:p = 0, given that p is time independent. 


We con now write the general solutions that can be built from the basic solution and 
apply to classically allowed and classically forbidden regions. 


On the allowed region E — V(x) > 0 we write p?(x) = h?k?(x), with k(x) > 0 and hence the 
general solution is a superposition of two basic solutions with waves propagating in opposite 
directions: 


w(x) = Re? fe [eee = exp | if wetyae| (3.2.20) 
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Note that the first wave, with coefficient A moves to the right while the second wave, 
with coefficient B moves to the left. This can be seen by recalling that the above energy 
eigenstate (x) is accompanied by the time factor e'Et/h when forming a full solution of 
the Schrédinger equation. Moreover, the first phase (accompanying A) grows as x grows, 
while the second phase becomes more negative as x grows. 


On the forbidden region p? = —h?«?(x) so we can take p(x) = ix(x) in the solution, 
with « > 0, to find 


N= aa | [ “(at + ees |- i, “(a a (3.2.21) 


The argument of the first exponential becomes more positive as x grows. Thus the first 
term, with coefficient C,, is an increasing function as x grows. The argument of the second 
exponential becomes more negative x grows. Thus the second term, with coefficient D, is 
a decreasing function as x grows. 


3.2.1 Validity of the approximation 


To understand more concretely the validity of our approximations, we reconsider the ex- 
pansion (3.2.8) of the differential equation: 


(So)? —p? + B(28)S) — i188) + O (H) =0 (3.2.22) 


We must have that the O (f#) terms in the differential equation are much smaller, in magni- 
tude, than the O (1) terms. At each of these orders we have two terms that are set equal to 
each other by the differential equations. It therefore suffices to check that one of the O (h) 
terms is much smaller than one of the © (1) terms. Thus, for example, we must have 


lASOSi| < |S6l?. (3.2.23) 
Canceling one factor of |S6| and recalling that |S4| = |p| we have 
|nSi| < |p|. (3.2.24) 


From (3.2.11) we note that |S{| ~ |p'/p| and therefore we get 
p 
om < |pl. (3.2.25) 
Pp 
There are two useful ways to think about this relation. First we write it as 
h\\| dp dp 
= lie + ale 3.2.26 
slkael < =| < lel (3.2.26) 


which tells us that the changes in the local momentum over a distance equal to the de 
Broglie wavelength are small compared to the momentum. Alternatively we write (3.2.25) 
as follows: 


/ 
Pp 
nS 


2a: is a Ze) (3.2.27) 
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This now means 


[S| ae (3.2.28) 


The de Broglie wavelength must vary slowly. Note the consistency with units, the left-hand 
side of the inequality being unit free. More intuitive, perhaps, is the version obtained by 
multiplying the above by A 


pl ae (3.2.29) 
dx 


| dX 
This tells us that the variation of the de Broglie wavelength 4 over a distance A must be 
much smaller that 4. 

It is not hard to figure out what the above constraints tell us about the rate of change 
of the potential. Finally connect to the potential. Taking one spatial derivative of the 


equation p? = 2m(E — V(a)) we get 


dV dV 1 
pp] = my = —|pp|. (3.2.30) 
x dx m 
Multiplying by the absolute value of \ = h/p 
dV 27h py 
r = i 32.31 
(oF |=—pl«=, (3.2.31) 


where the last inequality follows from (3.2.25). Hence, we find that 


dV p 
— —., 2.32 
dx . 2m 22) 


A(x) 


The change in the potential over a distance equal to the de Broglie wavelength must be 
much smaller than the kinetic energy. This is the precise meaning of a slowly changing 
potential in the WKB approximation. 

The slow variation conditions needed for the basic WKB solutions to be accurate fail 
near turning points. This could be anticipated since at turning points the local momentum 
becomes zero and the de Broglie wavelength becomes infinite. Under general conditions, 
sufficiently near a turning point the potential V(a) is approximately linear, as shown in 
Figure 3.1. We then have 


V(a)-E = g(a—-a), g>0. (3.2.33) 
In the allowed region x < a the local momentum is 


p? = 2m((E-—V(2)) = 2mg(a— 2). (3.2.34) 
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Figure 3.1: The slow-varying conditions in WKB are violated at turning points. 


As a result, the de Broglie wavelength is given by 


27h 27h 
r = = ‘ 3.2.35 
(x) F Pan gee ( ) 
Taking a derivative, we find 
dX Th 1 
= ‘ 3.2.36 
ral /2mg (a — x)3/2 ( ) 


The right-hand side goes to infinity as x — a and therefore the key condition (3.2.28) is 
violated as we approach turning points. Our basic WKB solutions can be valid only as 
long as we remain away from turning points. If we have a turning point, such as x = a in 
the figure, we need a ‘connection formula’ that tells us how a solution far to the left and a 
solution far to the right of the turning point are related when they together form a single 
solution. We will consider a connection formula right below. 


3.3. Connection formula 


Let us first explain and then use a set of connection formulae, leaving their derivation for 
next section. The connection formulae refer to solutions away from a turning point x = a 
separating a classically allowed region to the left and a classically forbidden region to the 
right. This is the situation illustrated in Figure 3.2. 

For such turning point at « = a the WKB solutions to the right are exponentials that 
grow or decay and the WKB solutions to the left are oscillatory functions. They connect 
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Figure 3.2: 


consistent with the following relations 


we ( i. Bic 4 = ae (- i, : x(a!) (3.3.1) 


- a sin es k(2")da" — 4 = a5 exp ak x(a')ae' (3.3.2) 


In the first relation the arrow tells us that if the solution is a pure decaying exponential to 
the right of « = a the solution to the left of = a is accurately determined and given by 
the phase-shifted cosine function the arrow points to. The second relation says that if the 
solution to the left of x = a is of an oscillatory type, the growing part of the solution to 
the right of « = a is accurately determined and given by the exponential the arrow points 
to. The decaying part cannot be reliably determined. As we will elaborate upon later, any 
connection formula is not to be used in the direction that goes against the arrow. 


Example: Find a quantization condition for the energies of bound states in a monoton- 
ically increasing potential V(x) that has a hard wall at « = 0. Assume V() increases 
without bound, as illustrated in Figure 3.3. 

Let E denote the energy of our searched-for eigenstate. Clearly, the energy and the 
potential V(a) determine the turning point x = a. The solution for x > a must only have a 
decaying exponential since the forbidden region extends forever to the right of r = a. The 
wavefunction for x > a is therefore of the type on the right-side of the connection formula 
(3.3.1). This means that we have an accurate representation of the wavefunction to the left 
of x =a. Adjusting the arbitrary normalization, we have 


v(x) = oa cos ( [0 Heaae' - +) ; Ue <a. (3.3.3) 


3.3. CONNECTION FORMULA 61 


5 
| 
x @ 
S MWA 


Figure 3.3: A monotonically increasing potential with a hard wall at x = 0. For an energy eigenstate 
of energy & the turning point is at 7 =a. 


Since we have a hard wall, the wavefunction must vanish for z = 0. The condition 7(0) = 0 


requires that 


cosA =), ‘with A= i k(a')da' — 4. (3.3.4) 
This is satisfied when 


a= k(a')de’ —-F = Finn, neZ. (3.3.5) 
0 


The quantization condition is therefore 


ii Kade = (n+ 2) 9; n= 0;1,2)4.: (3.3.6) 
0 


Negative integers are not allowed because the left-hand side is manifestly positive. The 
above is easy to use in practice. Using the expression for k(x) in terms of EF and V(x) we 


have 


[Reveal =+ 9m, n=0,1,2,... (3.3.7) 
0) 


In some special cases a can be calculated in terms of F and the integral can be done 
analytically. More generally, this is done numerically, exploring the value of the integral 
on the left hand side as a function of F and selecting the energies for which it takes the 


quantized values on the right-hand side. 
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Let us rewrite the wavefunction using f” = fy‘ — Jo: 


ies: ae ( i " R(a!)de! — ® i “k(e!\ae 
= Jes (a zs / *k(e!)ae' (3.3.8) 
= aa i ( / k(x')ae' 


where we expanded the cosine of a sum of angles and recalled that cos A = 0. In this form 
it is manifest that the wavefunction vanishes at x = 0. More interestingly, the quantization 
condition (3.3.6) indicates that the excursion of the phase in the solution from 0 to a is a 
bit higher than nz (but less than (n + 1)7). Thus the WKB wavefuntion produces the n 
nodes the n-th excited state must have, even though this wavefunction is not reliable all to 


way tox =a. 


3.4 Airy functions and connection formulae 


| Vix) 
CLASS/ CALLY CLACE ICALLY 
gre = Foeerppen) _ 
(3.4.9) 
U<O __— Ur0 
_ = u -——— oo - SO 7 ie 
KE XY OW x 
V(a) -E~ g(a -a) with g > 0 forr~a (3.4.10) 
Solutions 
A ‘3 Wr 9 B a aI 
v(x) = @) exp (- K(x )av’) + ae exp ( K(x )a’) xz>a_ (3.4.11) 


ia = on ( ’, *k(a!)ae' + Ta? (-i i, “k(x! a<a (3.412) 
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where 
g 2m 2mg 
2m 2mg 
Die 
n= Sh(V(e)- B)~ (ea) 22a 
Idea: Solve Schrodinger equation 
h2 


2m 
with the exact linear potential, i.e. 


e Solve (3.4.16) exactly. 
e Match it to the x > a version of (3.4.11) or « < a version of (3.4.12). 


e Find the required relations between coefficients! 


Solve (3.4.16). Remove units, find length scale. 


so define 


thus (3.4.16) becomes 


with solution 
y= aA;(u) + BBi(u) 


where a and £ are constants and A;(u), B;(u) are Airy functions: 


11 |y|-teé ul, 
Aj(u) & aaa 

Fale teos(€-4F) w<-l. 

1 a2 & 1 

Fale 4e€ uUu> ) 
B;(u) ~ 


—Jzlul~# sin (6 — §) u<-1. 


with : 
€ = lu), 
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(3.4.13) 


(3.4.14) 


(3.4.15) 


(3.4.16) 


(3.4.17) 


(3.4.18) 


(3.4.19) 


(3.4.20) 


(3.4.21) 


(3.4.22) 


(3.4.23) 


These are in fact WKB solutions and special cases of connection conditions! (Explain) 
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3.4.1 How bad are the errors? 


Take a WKB solution and find the DE it satisfies 


1 Blu 
w(u) = —e 3!" (3.4.24) 
U4 
solves 
dy 5 1 
se 7 (Ut aoas)¥ = 0 (3.4.25) 


so clearly for u >> 1 the extra term is negligible. 
To evaluate (3.4.11) and (3.4.12), determine «(x) and k(x). Using (3.4.13) and (3.4.14): 


k? = 9 (a—2) = —1u = 7? (ul u<0 (3.4.26) 
K* = 7 (a — a) = +1*u u>0 (3.4.27) 


Using ' = +, the WKB condition is |k’| < |k?| From ... 


3 3 2 
1 1) 1) 
|2kk’| = 7° |k’| (3.4.28) 
2|k| nul |u| 
k? = 7 |u| thus 
/ 2 n? 2 3 1 
Two integrals to estimate 
x 1\ pf . ae ee 3|* 
Hae Ke jde = (a! — a)da’ = —n2 (a — a)? 
- “ 3 a 
2 2: Dis 
= gir (2 — a)2 = au? = glul? =< (3.4.30) 
a 3 2 3|° 
ie ee te i k(a’)da! = . (a — 2!)dx! = 2 [F(a — 23] 
av x x 
2 
= 32 (a — 2)? = <ul? = (3.4.31) 
both integrals are equal to €. 
1 1 1 1 
Sas oT (3.4.32) 
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(3.4.11) and (3.4.12), let {A, B,C, D} > 12{A, B,C, D} = to cancel the extra factor and 
produce the ... constants 


1 1 
jul" 4 _¢ jul"4 ¢ 
U(r) =A e °+ B——e u>il 
Vr Va 
= 2AA;(u) + BB;(u) (3.4.33) 
(a) _ ltl ee | pls -« u<-—-l (3.4.34) 
= Vir T Vit oa, 
Note that 
Ai Sg Be jul4 -i(e-g) (3.4.35) 
7 a Jn on 2 
i 
Ap B= as ei(é-F) (3.4.36) 
thus 
jul 
e' (A; —iB,) = Ti eff (3.4.37) 
i 
e'4(A;+iB;) = e eé (3.4.38) 
Now back in (3.4.34) 
w(x) = cee — iB;) + De~*4(A; + iB;) 
= (cel eases a) A ee (ide 2 iCe't) B; (3.4.39) 


2A = Ce't + De~*4 (3.4.40) 
iB =Ce'4 —De*t# => (3.4.41) 
1 jx 
C= rial (2A + iB) (3.4.42) 
Lge 
D = e'4 (2A — iB) (3.4.43) 


back in (3.4.12) 


1@) =a p(i fo aw’ ka ac -4)4 a = ex (i f"k(e!\ae! - 5) = 
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Oe a0 [ eleyae! =) aa ( [wade - ©) (3.4.44) 


Now discuss a subtlety. The formulas are in reality only usable in one direction with partial 
results. Some terminology helps. We always speak having in mind a fixed turning point. If 
we say we have a growing exponential at a turning point we mean we have a solution that 
grows as we move away from the turning point towards the classically forbidden region. 
Similarly, a decaying exponential means our solution decays as we move away from the 
turning point towards the classically forbidden region. In summary the connection arrow 
can go from a decaying exponential into an oscillatory solution and can go from an oscilla- 
tory solution into a growing exponential. 


3.4.2 The other connection conditions 


This time the turning point is at x = b, with the classically allowed region to the right of it 
and the classically forbidden region to the left of it. 


ae (- ) “s(a!\da') =% awe ( i ” k(a!)de! — 4 (3.4.45) 


= = exp Gi x(e')ae = 6 sin (f° k(2')da! — *) (3.4.46) 


For the first relation, if there is only a decaying exponential to the left, the wavefunction 
is definitely without the growing exponential, and the wavefunction to the right of x = 6 is 
accurately determined. This does not work in the reverse direction because a small error in 
the oscillatory function would give a growing exponential to the left of for x = b that would 
overtake the predicted decaying solution. 

For the second relation, the stated direction (from right to left) is reliable: an small 
error on the right side of x = b would produce a decaying exponential, thus a small error 
on the left-side of x = b. The reverse direction does not work: a small error to the left of 
x = b, a decaying exponential, would produce a large error in the right-hand side. 
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In summary, in all cases of the connection conditions, relative to a turning point, 


We can connect away from a decaying exponential and into a growing exponential. 


(3.4.47) 


3.5 Tunnelling through a barrier 


Figure 3.4: Tunneling in the WKB approximation. The energy F must be smaller than the height 
of the potential, and the barrier must be wide and smooth. 


We want to find the tunneling probability T for a wave of energy incident on a smoothly 
varying wide barrier. The energy FE of the wave must be smaller than the height of the 
potential. In this problem there is an incident wave, a reflected wave and a transmitted 
wave, and the associated WKB expressions will have amplitudes controlled by constants 
A, B, and F’, respectively. The situation is illustrated in Figure 3.4. 

Remarks: 


1) Expect T to be small. Little probability flux goes through and |A| ~ |B]. 
2) For « > 6 there is just an outgoing wave with amplitude controlled by F’. 


3) Within the barrier the component that decays as x grows is more relevant that the 
component that grows. It is the only component that can be estimated reliably. 


Consider the transmitted right-moving wave valid for x >> b 
F 
VK(2) 


This is a WKB solution with an extra phase of 7/4 chosen to make the argument of the 
exponential have the familiar form appearing in the connection formulae. Expanding the 


w(x) = exp ( [ k(a’)dx' — ix) oe Sb (3.5.1) 
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exponential 


Vr(a) = HS a Cx k(a")de! — x) ae a ca kG ie" = x) wd. (3.5.2) 


This is in standard form. We can now match the second term to an exponential that grows 
as we move to the left of x = b, by using (3.4.46) 


b 
U(L)barr = — exp (/ (eae ee OO (3-553) 


iF 
K(x) 
The subscript ‘barr’ indicates a solution in the barrier region. If we attempted to match 
the first term in (3.5.2) we would get an exponential that decays as we move to the left of 
x = b and it is unreliable given the growing exponential. We can now refer this solution to 
the point 7 = a 


: b x. 
W(X)barr = — = exp ( K(2")da! — | x(a')ae' ax<nx<b. (3.5.4) 


K(x) 


Defining 
b 
= / sala der (3.5.5) 


we have 
iFe? Pe eee & 
Ct 5 exp i K(a' )dx ax<xr<b. (3.5.6) 


K(x 


Since this is a decaying exponential to the right of x > a we can connect it to a solution to 
the left of « < a using (3.3.1) 


Heys mao ae €h nen res +) (3.5.7) 


This is a superposition of two waves, a wave Winc incident on the barrier and a wave ref 
reflected from the barrier. The incident part is 


iFe? 

VK(z) 
The sign in front of the integral may seem unusual for a wave moving to the right, but 
it is correct because the argument x appears in the lower limit of the integration. The 


transmission coefficient T is the ratio of the transmitted probability current over the incident 
probability current. Given the result in (3.2.19) we have 


Vine(@) = exp ( if k(e!\as! + if) (3.5.8) 


= probability current for q, = Paks — 6-20 (3.5.9) 
probability current for Wine | — iF'e?|? , ni 
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This is the well-known exponential suppression of the transmission coefficient. Using the 
earlier definition of @ the result is 


Twkb = exo (2 f K(a!)dz') . (3.5.10) 


We added the subscript ‘wkb’ to emphasize that this is the WKB approximation to the 
exact transmission coefficient. The integral extends in between the two turning points and 
captures information about the height and the width of the barrier. The integrand, as well 


as the turning points, depend on the energy as we can display by using the explicit value 
of K(x) : 


jee exp (2 [ amv) 2) dn!) (3.5.11) 


The WKB approximation only captures the exponentially decaying part of the transmission 
coefficient. There are corrections that are written as a prefactor to the exponential. These 
are not determined in this approximation. 


Example. Use the WKB approximation to estimate the transmission probability T for the 
rectangular barrier 


Vo ft <a, 
Vga OEE (3.5.12) 
0, otherwise. 


Assume the barrier is large. In terms of the familiar unit-free constant zo used to characterize 
square wells, this means that 

2 2mVoa? 

Zz — eo tt es 


See oo (3.5.13) 


Large barrier means large zp or large Voa?. Moreover we assume FE smaller than Vo; we 
cannot expect the approximation to work as the energy approaches the top of the barrier. 
Compare the WKB result for T with the exact result. 

Solution. The WKB estimation, using (3.5.11) is immediate. Since the barrier extends 
from x = —a to x = a and the potential is constant we have 


OF tD, 
Twkb = exp(-2 [ qr (2) da’) (3.5.14) 


The integral can be done and we find 


fae = exp(-4 Vim(W— B) i (3.5.15) 


This is the answer in the WKB approximation. In terms of zp this reads 


E 
Twkb & exp(—4201 oa ) : (3.5.16) 
Vo 
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The validity of the WKB approximation requires the argument in the exponent to be large. 
This is why we need zo > 1 and the energy not to approach Vo. When FE is very small 
compared to Vo, we have 

T ~ exp(—4z0). (3.5.17) 


Since zo ~ a\/Vo, this result is the basis for the claim that the exponential suppression of 
the tunneling probability is proportional to the width of the well and the square root of its 
height. 

The exact formula for the tunneling probability of a square barrier has been calculated 
before, and it is given by 


1 Ve 
— = 1+ 
i 4E(Vo — E) 


sinh? (—* V2m(Vo — E)) (3.5.18) 


This formula will allow us to confirm the WKB exponential suppression and to find the 
prefactor. Under the conditions zp >> 1 and Vo — F finite, the argument of the sinh function 
is large so this function can be replaced by its growing exponential (sinh x ~ 5e"): 


1 \ 4a 
7 14 2 exp( Pa VJ 2m(Vo — B)) ' (3.5.19) 


16E(1- 


where we also replaced Vo — E — Vo in the prefactor. The additive unit on the right-hand 
side can be neglected and we have 


T~ 6w£(1-€) exp(- am(Vo — E)) (3.5.20) 


This result gives the same in exponential suppression as the WKB result (3.5.15). It also 
gives us a good approximation to the prefactor. 


The WKB approximation is often used for the estimation of lifetimes. The physical 
situation is represented in Figure 3.5. We have a particle of mass m and energy EF localized 
between the turning points x = a and x = b of the potential. The classical particle cannot 
escape because of the energy barrier stretching from x = b to x = c. The quantum particle, 
however, can tunnel. The state of the quantum particle when localized between a and b 
is not strictly an energy eigenstate because the wavefunction for any energy eigenstate is 
nonzero for x > c. Our goal is to determine the lifetime 7 of the particle. This is done by 
using some classical estimates as well as the quantum transmission probability. 

We first show that the lifetime 7 is the inverse of 6, the tunneling probability per unit 
time. To see this we examine the function P(t) that represents the probability to have the 
particle localized within [a,b] at time t, if it was there at t = 0. The lifetime is said to be + 
if P(t) has the time dependence P(t) = e~/7. To see that we have such a situation when 
the tunneling rate B is constant note that 


P(t+dt) = P(t)- (probability that it does not tunnel in the next dt). (3.5.21) 
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Figure 3.5: Estimating the lifetime of a particle with energy E temporarily localized in x € [a, b]. 


Therefore 
P(t+dt) = P(t)-(1-Bdt) + P(t+dt)— P(t) = —BP(t)dt. (3.5.22) 
We thus have the differential equation, and solution, 
dP 


a SBP). a PO) oo, (3.5.23) 
from which we identify, as claimed, that 
1 
T= =. (3.5.24) 


The tunneling rate 6 or tunneling probability per unit time is estimated by counting the 
number of times nypiz the classical particle, bouncing between x = a and x = BJ, hits the 
barrier at x = b per unit time, and multiplying this number by the tunneling probability T: 


T 
=mtl = —, 3.5.25 
B=nnit At ( ) 
where At is the time the classical particle takes to go from 6 to a and back to b: 
b b 
d d. 
At = ay ec 2m | a. (3.5.26) 
a U(x) a P(Z) 
We now can put all the pieces together, using the WKB approximation for 7’. We find 
At / > dz . 
T= —y 2m —_— -exp(2 f K(x dw) 3.5.27 
T 1 Pa) a ae 


The smaller the tunneling probability, the larger the lifetime of the localized state. Note 
that the factor At carries the units of the result. The above result can easily fail to be 
accurate, as the WKB approximation for T does not give the prefactor multiplying the 
exponential suppression. 


Chapter 4 


Time Dependent Perturbation 
Theory 


© B. Zwiebach 


4.1 Time dependent perturbations 


We will assume that, as before, we have a Hamiltonian H©) that is known and is time 
independent. Known means we know the spectrum of energy eigenstates and the energy 
eigenvalues. This time the perturbation to the Hamiltonian, denoted as dH(t) will be time 
dependent and, as a result, the full Hamiltonian H(t) is also time dependent 


H(t) = H© + 6H(t). (4.1.1) 


While H© has a well-defined spectrum, H(t) does not. Being time dependent, H(t) does 
not have energy eigenstates. It is important to remember that the existence of energy 
eigenstates was predicated on the factorization of solutions (x,t) of the full Schrédinger 
equation into a space-dependent part v(x) and a time dependent part that turned out to be 
eiEt/h with E the energy. Such factorization is not possible when the Hamiltonian is time 
dependent. Since H(t) does not have energy eigenstates the goal is to find the solutions 
|W(x,t)) directly. Since we are going to focus on the time dependence, we will suppress the 
labels associated with space. We simply say we are trying to find the solution |W(t)) to the 
Schrédinger equation 


in w(t) = (H + 6H(t))|(o). (4.1.2) 

In typical situations the perturbation 6H (t) vanishes for t < to, it exists for some finite 
time, and then vanishes for t > tr (see Figure 4.1). The system starts in an eigenstate of 
H©) at t < to or a linear combination thereof. We usually ask: What is the state of the 
system for t > ts? Note that both initial and final states are nicely described in terms of 
eigenstates of H) since this is the Hamiltonian for t < to and t > t f Even during the 
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Figure 4.1: Time dependent perturbations typically exist for some time interval, here from to to tf. 


time when the perturbation is on we can use the eigenstates of H) to describe the system, 
since these eigenstates form a complete basis, but the time dependence is very nontrivial. 

Many physical questions can be couched in this language. For example, assume we have 
a hydrogen atom in its ground state. We turn on EM fields for some time interval. We can 
then ask: What are the probabilities to find the atom in each of the various excited states 
after the perturbation turned off? 


4.1.1 The interaction picture 


In order to solve efficiently for the state |W(t)) we will introduce the Interaction Picture of 
Quantum Mechanics. This picture uses some elements of the Heisenberg picture and some 
elements of the Schrédinger picture. We will use the known Hamiltonian H ©) to define 
some Heisenberg operators and the perturbation 6H will be used to write a Schrodinger 
equation. 

We begin by recalling some facts from the Heisenberg picture. For any Hamiltonian, 
time dependent or not, one can determine the unitary operator U/(t) that generates time 
evolution: 


Ve) = U(H|VO)). (4.1.3) 


The Heisenberg operator Ay associated with a Schrédinger operator A, is obtained by 
considering a rewriting of expectation values: 


(U(E)As|Y(A)) = (W(O)|U (ASU (t)|¥O)) = (H(0)|An|¥(0)), (4.1.4) 


where 


Ay = U'(t)A,U(t). (4.1.5) 


This definition applies even for time dependent Schrédinger operators. Note that the oper- 
ator Ut brings states to rest: 


Ut (t)|W(t)) = UU" (#)LA(E)|W(0)) = |¥(0)) (4.1.6) 


In our problem the known Hamiltonian H) is time independent and the associated unitary 
time evolution operator Uo(t) takes the simple form 


a) . 


Uo(t) = exp (- i 


(4.1.7) 
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The state |W(t)) in our problem evolves through the effects of H (°) plus 6H. Motivated 
by (4.1.6) we define the auxiliary ket |U(t)) as the ket |W(¢)) partially brought to rest 
through H): 


& {HO 
\(#)) = exp (“J \wm). (4.1.8) 


Expect that Schrodinger equation for |W (t)) will be simpler, as the above must have taken 
care of the time dependence generated by H®). Of cours, if we can determine | W(t) we 
can easily get back the desired state | U(t)) inverting the above relation to find 


(HOR, ~ 
|W(t)) = exp (- < “)\HH). (4.1.9) 


Our objective now is to find the Schrédinger equation for |@(t)). Taking the time 
derivative of (4.1.8) and using (4.1.2) 


~ me iH) 
in H()) = HO |B) + exp (A) a7 + sn H)|¥) 


= | HO 4 exp (24) (H + 5H1(4)) exp (| |W (t)) 


h 


a us ~*) a(t), (4.1.10) 


= exp (——)omw exp (- i 


where the dependence on H) cancelled out. We have thus found the Schrédinger equation 


in |H(0)) = 5H(t)|W(t)). (4.1.11) 


where the operator 6H (t) is defined as 


jHO? 


6H(t) = exp ( a 


)oH(e) exp (- (4.1.12) 


Note that as expected the time evolution left in |W(t)) is generated by oH (t) via a Schrodinger 
equation. The operator oH (t) is nothing else but the Heisenberg version of 6H generated 
using H©! This is an interaction picture, a mixture of Heisenberg’s and Schrodinger pic- 
ture. While we have some Heisenberg”) operators there is still a time dependent state 
|v (t)) and a Schrédinger equation for it. 
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How does it all look in an orthonormal basis? Let |n) be the complete orthonormal basis 
of states for H(): 
Hn) = E,|n). (4.1.13) 


This time there is no need for a (0) superscript, since neither the states nor their energies 
will be corrected (there are no energy eigenstates in the time-dependent theory). We then 
write an ansatz for our unknown ket: 


|W(t)) = So en(t)|n). (4.1.14) 


Here the functions c,(t) are unknown. This expansion is justified since the states |n) form 
a basis, and thus at all times they can be used to describe the state of the system. The 
original wavefunction is then: 


|W (t)) = exp (eke |B(t)) = So en(t) exp (-5") In) . (4.1.15) 


Note that the time dependence due to H (0) is present here. If we had 6H = 0 the state 
| W(t) would be a constant, as demanded by the Schrédinger equation (4.1.11) and the 
Cn(t) would be constants. The solution above would give the expected time evolution of the 
states under H, 

To see what the Schrédinger equation tells us about the functions c,(t) we plug the 
ansatz (4.1.14) into (4.1.11): 


im em(t)im) = dH(t) S~ en(t)|n) (4.1.16) 


Using dots for time derivatives and introducing a resolution of the identity on the right-hand 
side we find 


Do them (tlm) = 7 bm) (m5) YT en (/)In) 
= So lm) Do (m|dH(E)In)en(t) 


= S~ 5 Hmn(t)en(t)|m) . (4.1.17) 


myn 
Here we have used the familiar matrix element notation 
SH mn(t) = (m|dH(t)|n). (4.1.18) 
Equating the coefficients of the basis kets |m) in (4.1.17) we get the equations 


hen(t) = ¥ 0Hmadene)s (4.1.19) 
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The Schrédinger equation has become an infinite set of coupled first-order differential equa- 
tions. The matrix elements in the equation can be simplified a bit by passing to un-tilde 
variables: 


~ (HOt iHOt 
OH) = (m| exp ( z )oH(e) exp (- Ps In) 
(4.1.20) 
= exp (Em = Fa) (mldH(t)\n) . 
If we define E E 
mn =——, 4.1.21 
, - (4.1.21) 
we then have ae 
Ot) = see oH as), (4.1.22) 
The coupled equations (4.1.19) for the functions c,,(t) then become 
then(t) = Yer" Oo Halt)en(t) (4.1.23) 


4.1.2 Example (based on Griffiths Problem 9.3) 


Consider a two-state system with basis states |a) and |b), eigenstates of H©) with energies 
Eq and Ep, respectively. Call 
Wab = (Ea = Ey) /h . (4.1.24) 


Now take the perturbation to be a matrix times a delta function at time equal zero. Thus 
the perturbation only exists for time equal zero: 


0 a 
ax 0 


5H(t) = ‘ ) 5(t) =U d(t), (4.1.25) 


where qa is a complex number. With the basis vectors ordered as |1) = |a) and |2) = |b) we 


have 
- te Ae » with OG, = Uy —0and VU, = Usa =a. (4.1.26) 


There is a sudden violent perturbation at ¢ = 0 with off-diagonal elements that should 
produce transition. Take the system to be in |a) for t = —oo, what is the probability that 
it is in |b) for t = +00? 


Solution: First note that if the system is in |a) at t = —oo it will remain in state |a) until 
t = 0°, that is, just before the perturbation turns on. This is because |a) is an energy 
eigenstate of H©). In fact we have, up to a constant phase, 


[wit)) = e-*Pet/*g), for —0co<t<0-. (4.1.27) 
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We are asked what will be the probability to find the state in |b) at t = oo, but in fact, the 
answer is the same as the probability to find the state in |b) at t =0*. This is because the 
perturbation does not exist anymore and if the state at t = 07 is 


|W(OF)) = yala) + |b) , (4.1.28) 
with 7, and y2 constants, then the state for any time t > 0 will be 
|UQ) = ryalayePat/® + yp|bye*Pe/™ (4.1.29) 
The probability p,(t) to find the state |b) at time t will be 
polt) = (VE)? = [we */"| = re? (4.1.30) 


and, as expected, this is time independent. It follows that to solve this problem we must 
just find the state at t = 0* and determine the constants 7 and 9. 
Since we have two basis states the unknown tilde state is 


|W (t)) = ca(t)|a) + cp(t)|d) , (4.1.31) 
and the initial conditions are stating that the system begins in the state |a) are 

BO" reels. 60s.) = 0% (4.1.32) 
The differential equations (4.1.23) take the form 


ihég(t) = eS Hap (t)en(t) , 


(4.1.33) 
ihe (t) = eS Hyg (t)cq(t) . 


The couplings are off-diagonal because 6Hgq = 6H», = 0. Using the form of the 6H matrix 
elements, 


ihég(t) = ett @6d(t) o(t), 


: (4.1.34) 
thé(t) = eb" a*5(t) ca(t). 

We know that for functions f continuous at t = 0 we have f(t)d(t) = f(0)d(t). We now 
ask if we are allowed to use such identity for the right-hand side of the above equations. In 
fact we can use the identity for e+" but not for the functions cqa(t) and c(t) that, are 
expected to be discontinuous at t = 0. They must be so, because they can only change at 
t = 0, when the delta function exists. Evaluating the exponentials at t = 0 we then get the 
simpler equations 


ihég(t) = ad(t) a(t), 


-_ (4.1.35) 
ihe(t) = a*d(t) ca(t). 
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With such singular right-hand sides, the solution of these equations needs regulation. We 
will regulate the delta function and solve the problem. A consistency check is that the 
solution has a well-defined limit as the regulator is removed. We will replace the d(t) by 
the function A;,,(t), with to > 0, defined as follows: 


1/to, for t € [0, to] , 


; (4.1.36) 
0, otherwise. 


Note that as appropriate, { dtA,,(t) = 1, and that A,,(t) in the limit as tg + 0 approaches 
a delta function. If we replace the delta functions in (4.1.37) by the regulator we find, for 
€ (0, to] 


Fort €[0,to]: — thea(t) = > a(t), 
= (4.1.37) 
ihe(t) = ~ eq (t). 
to 


For any other times the right-hand side vanishes. By taking a time derivative of the first 
equation and using the second one we find that 


7 lala‘ |a| 
w(t) = ——— a(t) = a(t) . 4.1.38 
is () ii ih ty (#) =() s (#) ( ) 
This is a simple second order differential equation and its general solution is 
t 
Ca(t) = Bo cos & alt 7) + Bis in( al 2 (4.1.39) 
hto 
This is accompanied with c(t) which is readily obtained from the first line in (4.1.37): 
ifto . el n( Bt) os( oF =) 
= w(t) = = : 4.1.40 
a(t) = Sete) = “Sl (—posin( BE) + 1 c05 (Fe (4.1.40) 


The initial conditions (4.1.32) tell us that in this regulated problem cq(0) = 1 and c,(0) = 0. 
Given the solutions above, the first condition fixes 89 = 1 and the second fixes 6; = 0. Thus, 
our solutions are: 


For t € [0, to] : Cali) = 008 (e) : (4.1.41) 
hto 
ae me ale (*) (4.1.42) 
a hto 


When the (regulated) perturbation is turned off (t = to), cq and cy stop varying and the 
constant value they take is their values at t = to. Hence 


Cat >to) =ca( to) = cos (4!) : (4.1.43) 
elt >to) = gto = lies sin (4) : (4.1.44) 
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Note that these values are to independent. Being regulator independent, we can safely take 
the limit to > 0 to get 


Ca(t > 0) = cos (4) : (4.1.45) 
lal . fla 
c(t >0) = -;fal sin (4) : (4.1.46) 
Our calculation above shows that at t = 0* the state will be 

|w(0+)) = |W(0+)) = cos (4) | a) io sin (4) |). (4.1.47) 

With only free evolution ensuing for t > 0 we have (as anticipated in (4.1.29)) 
|W(t)) = cos (4) | ane ee jlo sin (4) |) ere, (4.1.48) 

a 


The probabilities to be found in |a) or in |b) are time-independent, we are in a superposition 
of energy eigenstates! We can now easily calculate the probability p,(t) to find the system 
in |b) for t > 0 as well as the probability p,(t) to find the system in |a) for t > 0: 


iorw(e)? = sin? (FE) 


Ka} W(H)|? = eos? (IE) 


po(t) 
(4.1.49) 


Pa(t) 


Note that pa(t)+pp(t) = 1, as required. The above is the exact solution of this system. If we 
had worked in perturbation theory, we would be taking the strength |a| of the interaction 
to be small. The answers we would obtain would form the power series expansion of the 
above formulas in the limit as |a|/h is small. 


4.2 Perturbative solution 


In order to set up the perturbative expansion properly we include a unit-free small parameter 
\ multiplying the perturbation 6H in the time-dependent Hamiltonian (4.1.1): 


H(t) = H© + dH (t). (4.2.1) 


With such a replacement the interaction picture Schrédinger equation for |W(t)) in (4.1.11) 
now becomes 


in“ |H(t)) = SH (t)/H(t)). (4.2.2) 


As we did in time-independent perturbation theory we start by expanding | w(t) in powers 
of the parameter A: 


|B) = WOM) 4A, WV@) 427/B%H) +0 (3) . (4.2.3) 
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We now insert this into both sides of the Schrédinger equation (4.2.2), and using 0; for time 
derivatives, we find 


ihd,|VO (t)) + Ach, |W (t)) + A? iHd,| GO ()) + 3 sha, | GO (t)) + O (a4) 


(4.2.4) 
= A\6H|WO(t)) +7 6H |WOA)) +9 5H |W (e)) +0 (r4) . 
The coefficient of each power of \ must vanish, giving us 
ind, |VOt)) = 0, 
ind, WO(t)) = SH)WOA)), 
ind,|W(t)) = SH|wV(A)), (4.2.5) 


in, |VO*M (t)) = SH|W (Ly). 


The origin of the pattern is clear. Since the Schrodinger equation has an explicit A multi- 
plying the right-hand side, the time derivative of the n-th component is coupled to the 6H 
perturbation acting on the (n — 1)-th component. 

Let us consider the initial condition in detail. We will assume that the perturbation 
turns on at t = 0, so that the initial condition is given in terms of |W(0)). Since our 
Schrédinger equation is in terms of |W(t)) we use the relation ((4.1.8)) between them to 
conclude that both tilde and un-tilde wavefunctions are equal at t = 0: 


|@(0)) = |W(0)). (4.2.6) 
Given this, the expansion (4.2.3) evaluated at t = 0 implies that: 
|@(0)) = |w(o)) = [WO 0)) +rAWOO)) +7])HR0)) +.0 (A) . (4.2.7) 


This must be viewed, again, as an equation that holds for all values of X. As a result, the 
coefficient of each power of A must vanish and we have 


YO) =|¥O), 


- (4.2.8) 
[WPO)) HO; wT 2 8a 
These are the relevant initial conditions. 

Consider now the first equation in (4.2.5). It states that je (t)) is time independent. 
This is reasonable: if the perturbation vanishes, this is the only equation we get and we 
should expect |W (t)) constant. Using the time-independence of Paw (t)) and the initial 
condition we have 


VO) = |W (0)) =|"), (4.2.9) 
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and we have solved the first equation completely: 


[oO = |wo)). (4.2.10) 


Using this result, the O(A) equation reads: 
indy] VM (t)) = 6H | VO (t)) = SA(A)|W(0)) (4.2.11) 


The solution can be written as an integral: 


WO) = [ 2) |W (0)) de’. (4.2.12) 
0 2 


Note that by setting the lower limit of integration at t = 0 we have implemented correctly 
the initial condition |V“(0)) = 0. The next equation, of order \? reads: 


ind,| V(t) = bw), (4.2.13) 
and its solution is eae 
\F2(n)) = / mt ) \BOW)) ad’, (4.2.14) 


consistent with the initial condition. Using our previous result to write we )) we now 
have an iterated integral expression: 


0 a ih 


It should be clear from this discussion how to write the iterated integral expression for 
|w*) (t)), with k > 2. The solution, setting 1 = 1 and summarizing is then 


; HO) 
(— t 


|U(e)) = exp ) to + POH) + WOH) +- ) (4.2.16) 


Let us use our perturbation theory to calculate the probability Py ~,»(t) to transition 
from |n) at t = 0 to |m), with m # n, at time t, under the effect of the perturbation. By 
definition 

Pmen(t) = |(m|¥(e))]”. (4.2.17) 


Using the tilde wavefunction, for which we know how to write the perturbation, we have 


Prne-n(t) = |(mle #4" Gt)y|? = |(m|¥(e))|?, (4.2.18) 
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since the phase that arises from the action on the bra vanishes upon the calculation of the 
norm squared. Now using the perturbation expression for |W(t)) (setting 4 = 1) we have 


es 2 : 
Prren(t) = I(r (ju) +|WM()) + |G) +. | (4.2.19) 
Since we are told that |W(0)) = |n) and |n) is orthogonal to |m) we find 


2 
| (4.2.20) 


eee Ce (m| G0) + (m|G2(t)) +... 


To first order in perturbation theory we only keep the first term in the sum and using our 
result for |W“) (¢)) we find 


(4.2.21) 


' (m|dH(t')|In) 
/ a 


t crr(4! a 
Pa) ,(®) = lm [Pinal = 


Recalling the relation between the matrix elements of 6H and those of 5H we finally have 
our result for the transition probability to first order in perturbation theory: 


Os ce Haasan eal 
PO) (t) = / elmnt OnE) a! (4.2.22) 
0 ih 
This is a key result and will be very useful in the applications we will consider. 
Exercise: Prove the remarkable equality of transition probabilities 
Phat) = Pea. (4.2.23) 


valid to first order in perturbation theory. 


It will also be useful to have our results in terms of the time-dependent coefficients c,,(t) 
introduced earlier through the expansion 


YO) = do en(t)In). (4.2.24) 


Since |W(0)) = |¥(0)) the initial condition reads 


|¥(0)) = S>en(0)|n) = |W(O)), (4.2.25) 


where we also used (4.2.9). In this notation, the c,(t) functions also have a \ expansion, 
because we write 7 
PC) =S>MOIn), &=0,1,2,... (4.2.26) 


n 
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and therefore the earlier relation 


|B(e)) = WOM) 4A, WOH) +.27|GD) +0 (9) . (4.2.27) 
now gives 
cn(t) = c(t) + Ac (t) + A2cP)(t) +... (4.2.28) 


Since |W) (¢)) is in fact constant we have 

c(t) = (0) = en (0), (4.2.29) 

where we used (4.2.25). The other initial conditions given earlier in (4.2.8) imply that 
(0) = 0, k=1,2,3.... (4.2.30) 


Therefore, using our result (4.2.12) for |e (t)) we have 


POW) =P (In) = / OE at’ en(0)In). (4.2.31) 
and as a result, 
(0) = (m|GO) =D f (mls) 6 aya. (4.2.32) 
We therefore have 
oe : 1 iwmnt! 6 Himn(t’) 
Aaya >: i, at’ eter Aron’) (0). (4.2.33) 


The probability P,,(t) to be found in the state |m) at time t is 
= 2 
Pr(t) = |(mjw(e))P = |imiHO))” =lem(4)P. (4.2.34) 
To first order in perturbation theory the answer would be (with A = 1) 


PY (t) = lem(0) + Dt)? 


4.2. 
= |em(0)|” + ¢m(0)*chD (€) + eb (t)*em(0) + O(SH?) . es 


Note that the ese (t)|? term cannot be kept to this order of approximation, since it is of 
the same order as contributions that would arise from c) (t). 
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4.2.1 Example NMR 


The Hamiltonian for a particle with a magnetic moment inside a magnetic field can be 
written in the form 


H=w-S, (4.2.36) 


where S is the spin operator and w is precession angular velocity vector, itself a function 
of the magnetic field. Note that the Hamiltonian has properly units of energy (wh). Let us 
take the unperturbed Hamiltonian to be 


H®) = wo Sz = £ woo : (4.2.37) 
For NMR applications one has wo ~ 500 Hz and this represents the physics of a magnetic 
field along the z axis. Let us now consider some possible perturbations. 
Case 1: Time independent perturbation Let us consider adding at t = 0 a constant 
perturbation associated with an additional small uniform magnetic field along the x axis: 
H=H+6H, with 5H =QS,, (4.2.38) 
For this to be a small perturbation we will take 
<u. (4.2.39) 


Hence, for the full Hamiltonian H we have w = (2,0, wo) 


H = (Q,0,w0) -(Se, Sy, Sz) (4.2.40) 
——" 


. ve J 


The problem is simple enough that an exact solution is possible. The perturbed Hamiltonian 
H has energy eigenstates |n;-), spin states that point with n = ial with energies +/|w}. 
These eigenstates could be recovered using time-independent non-degenerate perturbation 
theory using Q/wo as a small parameter and starting with the eigenstates |+) of H (0) 

In time-dependent perturbation theory we obtain the time-dependent evolution of initial 
states as they are affected by the perturbation. Recalling (4.2.12) we have 


|e (t)) = i Bate |W (0) ae! (4.2.41) 
0 v 
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The calculation of 6H requires a bit of computation. One quickly finds that 


5H (t) = exp [iewot | QS, exp | -iwot | =o (5. cos wot — Sy sin wot) . (4.2.42) 
As a result we set: 
= Of? 7s F 
wO(t)) = = S, cos wot’ — Sy sinwot’ ) |W(0))de’ 
ih 0 # 
= ao E sin wot + ( cos(wot’) — 1) 5, |W(0)) . (4.2.43) 


As expected, the result is first order in the small parameter Q/wo. Following (4.2.16) the 
time-dependent state is, to first approximation 


| W(t) = exp | iwot | (: | i E sin wot + (cos(wot’) — 154] 1s) 


+ O((Q/w)*). 


(4.2.44) 


Physically the solution is clear. The original spin state |©(0)) precesses about the direction 
of w with angular velocity w. 


Case 2: Time dependent perturbation. Suppose that to the original Hamiltonian we 
add the effect of a rotating magnetic field, rotating with the same angular velocity wo that 
corresponds to the Larmor frequency of the original Hamiltonian: 


6H (t) = (5. cos wot + S, sin wot) : (4.2.45) 


To compute the perturbation 6H we can use (4.2.42) with t replaced by —t so that the 
right-hand side of this equation is proportional to 6H: 


exp |-iwot | QS, exp [iwot | = (5. COS Wot + Sy sin wot) ‘ (4.2.46) 
Moving the exponentials on the left-hand side to the right-hand side we find 
QS, = exp [iene | Q (3. COS Wot + S, sin wot) exp | -iwot | . (4.2.47) 


The right-hand side is, by definition, 6H. Thus we have shown that 
SH(t) =2S8,, (4.2.48) 


is in fact a time-independent Hamiltonian. This means that the Schrodinger equation for 
|v) is immediately solved 


|u(t)) = exp| i 00) = exp [54 a). (4.2.49) 
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The complete and exact answer is therefore 


xp |- get |(t)) = exp iwot | exp ine | |w(0)). (4.2.50) 


This is a much more non trivial motion! A spin originally aligned along Z will spiral into 
the x,y plane with angular velocity Q. 


4.3 Fermi’s Golden Rule 


Let us now consider transitions where the initial state is part of a discrete spectrum but the 
final state is part of a continuum. The ionization of an atom is perhaps the most familiar 
example: the initial state may be one of the discrete bound states of the atom while the 
final state includes a free electron a momentum eigenstate that is part of a continuum of 
non-normalizable states. 

As we will see, while the probability of transition between discrete states exhibits pe- 
riodic dependence in time, if the final state is part of a continuum an integral over final 
states is needed and the result is a transition probability linear in time. To such probability 
function we will be able to associate a transition rate. The final answer for the transition 
rate is given by Fermi’s Golden Rule. 

We will consider two different cases in full detail: 


1. Constant perturbations. In this case the perturbation, called V turns on at t = 0 but 
it is otherwise time independent: 


H®) , for t<0, 
H= (4.3.1) 


7 H®O+V for t>0. 


This situation is relevant for the phenomenon of auto-ionization, where an internal 
transition in the atom is accompanied by the ejection of an electron. 
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2. Harmonic perturbations. In this case the time dependence of the perturbation 6H is 
periodic, namely, 
H(t) = H®) + 6H(t), (4.3.2) 
with 
0, for t<0, 


dH(t) = (4.3.3) 
2H'coswt, for t>0. 

Note the factor of two entering the definition of 6H in terms of the time independent 

H'. This situation is relevant to the interaction of electromagnetic fields with atoms. 


Before starting with the analysis of these two cases let us consider the way to deal 
with continuum states, as the final states in the transition will belong to a continuum. 
The fact is that we need to be able to count the states in the continuuum, so we will 
replace infinite space by a very large cubic box of side length LE and we will impose periodic 
boundary conditions on the wavefunctions. The result will be a discrete spectrum where 
the separation between the states can be made arbitrarily small, thus simulating accurately 
a continuum in the limit LZ — oo. If the states are energetic enough and the potential is 
short range, momentum eigenstates are a good representation of the continuum. 

To count states use a large box, which can be taken to be a cube of side L: 


We call LZ a regulator as it allows us to deal with infinite quantities (like the volume of 
space or the number of continuum states). At the end of our calculations the value of L 
must drop out. This is a consistency check. The momentum eigenstates ~(x take the form 


ciket pikyy pikez (4.3.4) 


v(x) = VB 


with constant k = (kz, ky, kz). It is clear that the states are normalized correctly 


ie WO)P@Px = pez =p, (4.3.5) 
The k's are quantized by the periodicity condition on the wavefunction: 
partly, z) = vaytL,z) = pa,y2z+L) = pla,y,z). (4.3.6) 
The quantization gives 


kygb=2mnz 2 Ldky = 2ndnz, 
ky = 2aty > Ldky = 20dny; (4.3.7) 
k,L=2nmn, —- Ldk, = 2rdn,. 
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Define AN as the total number of states within the little cubic volume element d°k. It 
follows from (4.3.7) that we have 


L 3 
AN = dnzdnydnz = (=) dk. (4.3.8) 


Note that AN only depends on d?k and not on k itself: the density of states is constant in 
momentum space. 

Now let d°k be a volume element determined in spherical coordinates k,6,¢ and by 
ranges dk, d0,dd. Therefore 


dk = k*dk sindd0d¢ = k*dk dO (4.3.9) 


We now want to express the density of states as a function of energy. For this we take 
differentials of the relation between the wavenumber & and the energy E: 


hee m 
C= pee 2 kdk = 7) dE. (4.3.10) 
Back in (4.3.9) we have 
Pk = k dE dQ, (4.3.11) 
hence 3 
AN = (=) ko dQdE. (4.3.12) 
Qn ica 
We now equate 
AN = p(E)dE, (4.3.13) 


where p(£) is a density of states, more precisely, it is the number of states per unit energy 
at around energy & and with momentum pointing within the solid angle dQ. The last two 
equations determine for us this density: 


Ya: 


With a very large box, a sum over states can be replaced by an integral as follows 


ya Ss fo o(E)dE (4.3.15) 


states 


where the dots denote an arbitrary function of the momenta k of the states. 
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4.3.1 Constant transitions. 


The Hamiltonian, as given in (4.3.1) takes the form 


H() for t <0 
H= (4.3.16) 


HO+V = fort>0. 


We thus identify H(t) = V for t > 0. Recalling the formula (4.2.33) for transition ampli- 
tudes to first order in perturbation theory we have 


t 
YMa= >> i at! elem! Ln (0). (4.3.17) 


To represent an initial state 7 at = 0 we take c,(0) = dn,;. For a transition to a final state 
f at time to we set m = f and we have an integral that is easily performed: 


; t 
to iw pit! 
(1) 1 pate Veer 
to) == Veer’ dt = — 4.3.18 
cy (to) ih Jo fee ih twp, ( ) 
Evaluating the limits and simplifying 
. . lw ito /2 . 40 
ge Gs ( us 5) ove Pe che: 
to) = fi (1- etn) = “A 43.1 
7 (to) Bead % Bees (ree | any 


The transition probability to go from 7 at t = 0 to f at t = to is then lee (to)|? and is 
therefore 


; it 
Asin? [== 2) 


Pe ip) = |V-s/?——--——+ : 4.3.20 
fe f (Ey = E;)? ( ) 


This first order result in perturbation theory is expected to be accurate at time to if 
Prei(to) < 1. Certainly a large transition probability at first order could not be trusted 
and would require examination of higher orders. 


To understand the main features of the result for the transition probability we examine 
how it behaves for different values of the final energy Ey. If Ey # E; the transition is said 
to be energy non-conserving. Of course, energy is conserved overall, as it would be supplied 
by the perturbation. If Ey = E; we have an energy conserving transition. Both are possible 
and let us consider them in turn. 


1. Ey # £;. In this case the transition probability Py_;(to) as a function of to is shown 
in Figure 4.2. The behavior is oscillatory with frequency |wy;. If the amplitude of the 
oscillation is much less than one 


A|Vp,| 


———__~ < l 4.3.21 
(BE; —E,)? ( ) 


4.3. FERMI’S GOLDEN RULE 91 


Pee U \ 


phy 4, 


\ ve) im 
Figure 4.2: The transition probability as a function of time for constant perturbations. 


then this first order transition probability Py,;(to) is accurate for all times to as it is 
always small. The amplitude is suppressed as | — E;| grows, due to the factor in 
the denominator. This indicates that the larger the energy ‘violation’ the smaller the 
probability of transition. This is happening because a perturbation that turns on and 
then remains constant is not an efficient supply of energy. 


2. Ey + Ej. In this limit wy; approaches zero and therefore 


it Es — E;)? 
sin? (“4% ~ Jane (4.3.22) 


It follows from (4.3.20) that 


2 
ie (4.3.23) 


The probability for an energy-conserving transition grows quadratically in time, and 
does so without bound! This result, however, can only be trusted for small enough to 
such that Pyi(to) <<a 


Note that a quadratic growth of Py,; is also visible in the energy non-conserving Ey # 
E; case for very small times to. Indeed, (4.3.20) leads again to lim4,.9 Pr—i(to) = 
[Vp5|7t2 /R?, while Ey # E;. This behavior can be noted near the origin in Figure 4.2. 


Our next step is to integrate the transition probability over the now discretized contin- 
uum of final states. Remarkably, upon integration the oscillatory and quadratic behaviors 
of the transition probability as a function of time will conspire to create a linear behavior! 

The sum of transition probabilities over final states is approximated by an integral, as 
explained in (4.3.15). We thus have 


. writ 
sin” (24% o 


So Ppealto) = f PresltoyolBabs =4 f Were aE ABs. (48.24) 
f a 
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We have noted that Pr; in the above integrand is suppressed as |E — E;| becomes large. 
We therefore expect that the bulk of the contribution to the integral will occur for a narrow 
range AE; of Ey near E;. Let us assume now that |V;;|? and p(Ey) are slow varying 
and therefore approximately constant over the narrow interval Ey (we will re-examine this 
assumption below). If this is the case we can evaluate them for Ey set equal to FE; and take 
them out of the integrand to find 


_ Val oe, — 
S> Prea(to) = 72 —P(Es = Ei) I(to), (4.3.25) 
f 
where the integral I(t) is given by 
1 ae it dw ¢; 
I(to) = | sin? Gas dE; = h [ sin® Gas ae (4.3.26) 
Wr; 2 2 Wr; 


As it stands, the integral extends from minus infinity to plus infinity. It is useful to plot the 
final integrand for I(t) as a function of the integration variable wy;. The result is shown 
in Figure 4.3 and exhibits a main central lobe followed by symmetrically arranged lobes of 
decreasing amplitude. The lobes are separated by zeroes occurring for wr; = 27k/to, with 
k integer. Note that for 


Figure 4.3: Plot of — sin? (4°), the integrand of I. 
fi 


The largest contribution to I(to) arises from the main lobe 
T 2 
In terms of energies this corresponds to the range 


oh oth 
Bu 2 eye Be (4.3.28) 
to to 
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We need this range to be a narrow, thus tp must be sufficiently large. The narrowness 
is required to justify our taking of the density of states p(E) and the matrix element |V,;|? 
out of the integral. If we want to include more lobes this is can be made consistent with a 
narrow range of energies by making to larger. 

The linear dependence of (tg) as a function of to is intuitively appreciated by noticing 
that the height of the main lobe is proportional to te and its width is proportional to 1/to. 
In fact, the linear dependence is a simple mathematical fact made manifest by a change of 
variables. Letting 


it t 
u= “i = du= 5 tuopi (4.3.29) 
so that ore : be as 
2 i t i 
I(to) = fl So! / eae (4.3.30) 
to Joo u2+ z 2 Jog U 
0 


making the linear dependence in tg manifest. The remaining integral evaluates to 7 and we 
thus get 
I(to) = 5 Ato . (4.3.31) 


Had restricted the integral to the main lobe, concerned that the density of states and matrix 
elements would vary over larger ranges, we would have gotten 90% of the total contribution. 
Including the next two lobes, one to the left and one to the right, brings the result up to 
95% of the total contribution. By the time we include ten or more lobes on each side we 
are getting 99% of the answer. For sufficiently large to is is still a narrow range. 

Having determined the value of the integral [(to) we can substitute back into our ex- 
pression for the transition probability (4.3.26). Replacing to by t 


Al V5; |? T Qn 
yo Peci(t) = Fel By) 5 ht = 5 |VpilP (By) t- (4.3.32) 
k 


This holds, as discussed before, for sufficiently large t. Of course t cannot be too large as it 
would make the transition probability large and unreliable. The linear dependence of the 
transition probability implies we can define a transition rate w, or probability of transition 
per unit time, by dividing the transition probability by t: 


w= : So Pra): (4.3.33) 
f 


This finally gives us Fermi’s golden rule for constant perturbations: 


2 
Fermi’s golden rule: w = =|Vpil?o(Bp) Ey = E;. (4.3.34) 


Not only is the density of states evaluated at the energy E;, the matrix element Vs; is also 
evaluated at the energy E£; and other observables of the final state, like the momentum. In 
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this version of the golden rule the integration over final states has been performed. The 
units are manifestly right: [Veil has units of energy squared, p has units of one over energy, 
and with an A in the denominator the whole expression has units of one over time, as 
appropriate for a rate. We can also see that the dependence of w on the size-of-the-box 
regulator L disappears: The matrix element 


Vpi = (f|V |i) ~ L-8?, (4.3.35) 


because the final state wavefunction has such dependence (see (4.3.4)). Then the L depen- 
dence in [Vea l? ~ L~° cancels with the L dependence of the density of states p ~ L?, noted 
in (4.3.14). 

Let us summarize the approximations used to derive the golden rule. We have two 
conditions that must hold simultaneously: 


1. We assumed that to is large enough so that the energy range 


nh nh 
E,- k= ap 2 Byt ko, (4.3.36) 
0 0 


with k some small integer, is narrow enough that p(E) and |V;;|* are approximately 
constant over this range. This allowed us to take them out of the integral simplifying 
greatly the problem and making a complete evaluation possible. 


2. We cannot allow to to be arbitrarily large. As we have 


S > Pre i(to) = w to, (4.3.37) 
f 


we must keep wtp < 1 for our first order calculation to be accurate. 


Can the two conditions on to be satisfied? There is no problem if the perturbation can 
be made small enough: indeed, suppose condition 1 is satisfied for some suitable to but 
condition 2 is not. Then we can make the perturbation V smaller making w small enough 
that the second condition is satisfied. In practice, in specific problems, one could do the 
following check. First compute w assuming the golden rule. Then fix to such that wtp is 
very small, say equal to 0.01. Then check that over the range ~ hi/tp the density of states 
and the matrix elements are roughly constant. If this works out the approximation should 
be very good! 


Helium atom and autoionization. The helium has two protons (Z = 2) and two 
electrons. Let H) be the Hamiltonian for this system ignoring the the Coulomb repulsion 


between the electrons: ; 


2 2 
HOM BS oy Bee 4.3.38 
2m 11 2m 19 ( ) 


4.3. FERMI’S GOLDEN RULE 95 


Here the labels 1 and 2 refer to each one of the two electrons. The spectrum of this 
Hamiltonian consists of hydrogenic states, with n1,n2 the principal quantum numbers for 
the electrons. The energies are then 


1 1 1 1 
2 


E,,,= ~\08.8 en 


-n 422 £,,- -54.4 (+3) = - Be 
| = ss 
Cu, = see byes —tle 
AS | 
aS ae Ea. 4% —S4-4e¥ 
—-- 2, | 
Ear y —!3-ev 
Bw,a 7 — G&G av 


Figure 4.4: Hydrogenic states in helium and continuum states of negative total energy. 


For the hydrogen atom we have bound states of negative energy and continuum states 
of positive energy that can be described to a good approximation as momentum eigenstates 
of the electron which is no longer bound to the proton. Since we have two electrons in the 
case of helium, there are continuum states in the spectrum with negative energy. 

This first happens for nj = 1 in the limit as ng 4 oo. For nz = 1 and ng = ow the 
second electron is essentially free and contributes no energy. Thus a continuum appears for 
energy Ej... = —54.4eV. This (1S)(co) continuum extends for all E > —54.4eV as the free 
electron can have arbitrary positive kinetic energy. The state (2.5)(co) is the beginning of 
a second continuum, also including states of negative energy. In general the state (nS) (oo) 
with n > 1 marks the beginning of the n-th continuum. In each of these continua one 
electron is still bound and the other is free. A diagram showing some discrete states and a 
couple of continua is given in Figure 4.4. 

Self-ionizing energy-conserving transitions can occur because discrete states can find 
themselves in the middle of a continuum. The state (2,9)?, for example, with two electrons 
on the 2S configuration and with energy E22 = —27eV is in the middle of the (1)(oo) 
continuum. We can view the original (25)? hydrogenic state as a t = 0 eigenstate of H (9) 
and treat the Coulomb repulsion as a perturbation. We thus have a total Hamiltonian H 
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that takes the form 


e2 


H= HOV, v= (4.3.40) 


[r=] 
The perturbation V produces “auto-ionizing” Auger transitions to the continuum. We have 
a transition from a state with energy Fy. = —27eV to a state of one bound electron with 
energy —54.4eV and one free electron with kinetic energy of 27eV. That final state is part 
of a continuum. Radiative transitions from (25)? to 1525, with photo-emission, are in fact 
a lot less likely than auto-ionizing transitions! 

We will not do the quantitative analysis required to determine lifetime of the (2) state. 
In general auto-ionization is a process in which an atom or a molecule in an excited state 
spontaneously emits one of the outer-shell electrons. Auto-ionizing states are in general 
short lived. Auger transitions are auto-ionization processes in which the filling of an inner 
shell vacancy is accompanied by the emission of an electron. Our example of the (2S)? 
state is an Auger transition. Molecules can have auto ionizing Rydberg states, in which the 
little energy needed to remove the Rydberg electron is supplied by a vibrational excitation 
of the molecule. 


4.3.2 Harmonic Perturbation 


It is now time to consider the case when the perturbation is harmonic. We will be able 
to derive a similar looking Fermi golden rule for the transition rate. The most important 
difference is that now the transitions are of two types. They involve either absorption of 
energy or a release of energy. In both cases that energy (absorbed or released) is equal to 
hw where w is the frequency of the perturbation. 

As indicated in (4.3.3) we have 


H(t) = H® + 6H(t), (4.3.41) 
where the perturbation 6H (t) takes the form 


0, for t<0, 


5H(t) = (4.3.42) 


2H’ coswt , for t>0. 


Here w > 0 and H’ is some time independent Hamiltonian. The inclusion of an extra factor 
of two in the relation between 6H and H’ is convenient because it results in a golden rule 
does not have additional factors of two compared to the case of constant transitions. 

We again consider transitions from an initial state 7 to a final state f. The transition 
amplitude this time From (4.2.33) 


(1) a ee re 
Cr =, : dt’ e'F* 6H pi (t). (4.3.43) 
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Using the explicit form of 6H the integral can be done explicitly 


(1) ae hae eee pe 
cy (to) = in I e 2H ¢, coswt dt 


H' . to : , ; / 
= a i (cltuntort 4+ ellwsi—w)t ) dt! (4.3.44) 
ee) 
TT; ei(writw)to _] eiwri—w)to —] 
ee wr tw - Wi — W , 


Comments: 


e The amplitude takes the form of a factor multiplying the sum of two terms, each one a 
fraction. As tg > 0 each fraction goes to itg. For finite to, which is our case of interest, 
each numerator is a complex number of bounded absolute value that oscillates in time 
from zero up to two. In comparing the two terms the relevant one is the one with the 
smallest denominator.! 


e The first term is relevant as wr; +w * 0, that is, when there are states at energy 
Ey = E; — hw. This is “stimulated emission”, the source has stimulated a transition 
in which energy hw is released. 


e The second term relevant if wf; — w *& 0, that is, when there are states at energy 
Ey; = FE; + hw. Energy is transferred from the perturbation to the system, and we 
have a process of “absorption”. 


Both cases are of interest. Let us do the calculations for the case of absorption; the 
answer for the case of spontaneous emission will be completely analogous. We take 7 to be 
a discrete state, possibly bound, and f to be a state in the continuum at the higher energy 
Ey ® E,+hw. Since wy; ~ w, the second term in the last line of (4.3.44) is much more 
important than the first. Keeping only the second term we have 


H'.. 5 (wri—w)to 
(1) fi€ 5% Wi —W 
to) = 2 ————t 4.3.45 
a (to) h A isin ( 5) 0) , ( ) 


and the transition probability is 


4H? sin? (“2=“t0) 

(1) 2 fi 

Prei(to) = t = 4.3.46 
fo ( 0) Icy ( o)| 2 (wi = w)? ( ) 


The transition probability is exactly the same as that for constant perturbations (see 
(4.3.20)) with V replaced by H’ and wy; replaced by wy; — w. The analysis that follows is 


'This is like comparing two waves that are being superposed. The one with larger amplitude is more 
relevant even though at some special times, as it crosses the value of zero, it is smaller than the other wave. 
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completely analogous to the previous one so we shall be brief. Summing over final states 


we have 


5 Prei(to) = le: «i P(E )dEs = / Aleit aa Gud) (Ey)dE (4.3.47) 
keillo iP p : 3. 
- f f f h2 (wi /)? f f 


This time the main contribution comes from the region 


2 2 
era ee (4.3.48) 


In terms of the final energy this is the band 
Qrh Qrh 
By + hw ——— < By < Ejthw+ (4.3.49) 
0 0 


For sufficiently large to this is the narrow band of states illustrated in Figure 4.5. Assume 


4th 


Figure 4.5: In the absorption process we must integrate over a narrow band of states about 
the final energy F; + hw. 


that over the band H Fi and the density of states is constant so that we get 


4 
Py i(to) = | H'p;|? 0( Ei + hw [Sewn 4.3.50 
» ilo) = F [ET pal" al ) Gon ( ) 


Defining 
u=Z(wfi—w)to 4 du= jZdwfito, (4.3.51) 
the integral in (4.3.50) becomes 


2 


co 4t,2 oO 
2 t t 
‘ ae = du = of = O du = eae (4.3.52) 
—o0 (24) to A GaP 2 

to 
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Finally, the transition probability is 
27 19 
Pesala = s |H pil o( Bi + fw) to. (4.3.53) 
k 


The transition rate w is finally given by 


2 
Fermi’s golden rule: w = = p(Ep)|H pl” , Ep= Ej +hw. (4.3.54) 


Here H’(t) = 2H’ coswt. Equation (4.3.54) is known as Fermi’s golden rule for the case of 
harmonic perturbations. For the case of spontaneous emission the only change required in 
the above formula is E'y = Ej — hw. 


4.4 Ionization of hydrogen 


We aim to find the ionization rate for hydrogen when hit by the harmonically varying 
electric field of an electromagnetic wave. We assume the hydrogen atom has its electron 
on the ground state. In this ionization process a photon ejects the bound electron, which 
becomes free. 

Let us first do a few estimates to understand the validity of the approximations that will 
be required. If the electromagnetic field has frequency w the incident photons have energy 


Ey = hw. (4.4.1) 
The energy FE, and the magnitude k of the momentum of the ejected electron are given by 
h2 k2 


where the Rydberg R, is the magnitude of the energy of the ground state: 


2 ok 
2Ry=— =— 5 =a, Ry X136eV. (4.4.3) 
aj mas ao 


Inequalities 


1. Any electromagnetic wave has spatial dependence. We can ignore the spatial depen- 
dence of the wave if the wavelength » of the photon is much bigger than the Bohr 
radius ao: 

» 
—>il. (4.4.4) 
ag 
Such a condition puts an upper bound on the photon energy, since the more energetic 
photon the smaller its wavelength. To bound the energy we first write 
2x 2nc  2xhe 


ans iene ne re (4.4.5) 
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and then find 


Xr 2the 4n Ry Ry 
= = ~ 1722 —= 4.4.6 
ag hwag a hw hus ( ) 
The inequality (4.4.4) then gives 
hw <1722Ry ~ 23keV (4.4.7) 


2. For final states we would like to use free particle momentum eigenstates. This requires 
the ejected electron to be energetic enough not to be much effected by the Coulomb 
field. As a result, the photon to be energetic enough, and this constraint provides a 
lower bound for its energy. We have 


E,=Ey—Ry> Ry + hw > Ry. (4.4.8) 


The two inequalities above imply 
Ry < hw <1722 R,. (4.4.9) 

If consider that 1 < 10 we could take 
140eV < fw <2.3keV. (4.4.10) 


Note that even for the upper limit the electron is non relativistic: 2.3keV < mec? ~ 511keV. 


Exercise: Determine kag in terms of hw and R,. 


Solution: 1p? be 42 7 
= = = 1)= 1 4.4.11 
2m ae a (= ) 2maz (= ) ( ) 
hw hw 
k?a2 = — -1 kag = ,/—-—1. 4.4.12 
ao R, =? ao R, ( ) 


This is the desired result. In the range of (4.4.10) we have 


3 < kay < 13. (4.4.13) 


Calculating the matrix element. Our perturbation Hamiltonian is 
6H = -e®, (4.4.14) 


where ® is the electric scalar potential. Let the electric field at the atom be polarized in 
the 2 direction 
EK(t) = E(t)Z = 2Eo coswt Z (4.4.15) 
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The associated scalar potential is 


® = —E(t)z, (4.4.16) 
and therefore 
6H = eE(t)z = eE(t)r cos = 2 eEpr cos 6 coswt = 2H’ coswt (4.4.17) 
H’ 


We have thus identified, in our notation, 
H' = eEorcos@. (4.4.18) 


We want to compute the matrix element between an initial state |7) that is the ground state 
of hydrogen, and a final state |f) which is an electron plane wave with momentum k,. The 
associated wavefunctions are 


dis 2 
Final state: U; (x) = BR eur (4.4.19) 
i panies 
Initial state: wo(r) = eer (4.4.20) 
Ta9 


The only physical angle here is that between the ejected electron momentum k, and the 
electric field polarization E. We expect electron to be ejected maximally along E. This 
suggests rearranging the axes to have the electron momentum along the z axis and and 
letting @ be the angle between the electron momentum and the electric field. The integration 
variable r will have angles 6’, ¢’, and the angle between the electric field and r is now called 
6” as shown in the figure, so that H’ = eEgr cos 0”: 


(f| Hi) = fers —_ kT 6 For cos 6" e% 
| [3/2 703 
= Tan [rar sin 6’d6’ dq! et*" 8" r cos 6c a (4.4.21) 
Tas 
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The main complication with doing the integral is the factor cos 6”. The dot product of unit 
vectors along E and n is cos 6”: 


cos 6” = (sin @ cos ¢)(sin 6’ cos ¢’) + (sin @sin ¢)(sin 6’ sin ¢’) + cos 6 cos 6! 
= sin 6 sin &’ cos(¢ — ¢’) + cos 6 cos (4.4.22) 


Since the following integral vanishes 
[ew cos(¢ — ¢') = 0 (4.4.23) 


the first term in the expression for cos @” will vanish upon ¢’ integration. We therefore can 
replace cos 6” — cos @ cos @’ to find 


E ; / ease be 
(f|H'|4) = 0 | 3dr sin 6'd6' dg’ e*" °° cos 6 cos 6’. %0 
Jf rag L3 
eo 3 eee i ! /_ikr cos 0! 
(cos@)(27) [| r°dre % d(cos 6") cos #’e (4.4.24) 
et 


> / mae Ls 


Now let r = agu and do the radial integration: 


1 
(f\| Hi) = ee cos 0a d(cos 6’) cos’ | urdu evi t#ka0 cos 6) 
Jf rag L8 ee 
eEo 3! 


(4.4.25) 


1 
9 4 / / F 
ros ba f d(cos 4’) cos 6 (1 + ikag cos 6’)4 


7 / maeL3 


Writing x = cos 6’ the angular integral is not complicated: 


! 1 
(f|H’i) = 22 en cos bai Ze aL a dx 


2neE kag 
= 2 alae =o cos 6 (4.4.26) 


J ra,L (1 + k2a2)3 


Hence, with a little rearrangement 


_ kag 1 


Qe = 4 , 4.4.2 
(f|H'|2) 132,/m (eEpao) Jars (14 Pas cos 0 ( 7) 


The above matrix element has the units of (eLoa), which is energy. This is as expected. 


Squaring we find 
gk?aR cos? 


(4.4.28) 
[3 (1+ ka2)é 


|H',|?, = 1024 (eEoa) 
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At this point, with 6 the angle between k, and E, it is more convenient to align E along 
the z-axis and let ke be defined by the polar angles 0,¢. The density of states was given in 
(4.3.14) 


dX = amededp 
(4.4.29) 


so that using Fermi’s Golden rule, the rate dw to go into final states in the solid angle dQ 
is 


27 
dw = (Be) Hp? 


Qn L? m k?a> cos? 0 
— “ ~_ “dQ 10242(eE an)? -—2 : 4.4.30 
i 373 yard 0 1(e 040) EB (1 ye k2a2)6 ( ) 
It follows that 
dw 256 maz (eEoao)? kan 9 
= 6. 4.4.31 
dn «7 WR  &  (1+kagye ~* oe) 


oe is the probability of ionization per unit time and per unit solid angle. All the ejected 


electrons will have the same momentum hk. Note that the units have worked out: w ~ 
oo Hl ~ z as expected. In here 2Ep is the peak amplitude of the electric field in the 
wave. The cos? @ implies that the electron tends to be ejected in the direction of the electric 
field. 
The total ionization probability per unit time is obtained by integration over solid angle. 


Using 


[cos 6dQ = 5 Ar , (4.4.32) 


and recalling that a = 2R, 
0 
dw 512 (eEpag)? k3.az 
=.) AQ = : 4.4.33 
/ dQ 3. AR, (1+k2a2)6 ee 


For the window of validity (4.4.13) we have 9 < (kao)? < 169, and we neglect the “one” in 
the denominator to get 


5912 (eEpag)? 1 
— : 4.4.34 
” = “3° RR, _ (kao)? eae) 


This is our final answer. For numerical calculations it is useful to note the atomic units in 
which the answer takes the form 
_ 256 /E,p\21 1 
ve rg (a) ty (kag)? ’ 


(4.4.35) 
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Here E, = 2E is the peak amplitude of the electric field. Moreover, the atomic electric 
field E, and the atomic time t, are given by 


2R 
EB, = —%=5 = 5.14x10!'V/m 

ee nO (4.4.36) 
— a 2.42 x 107!" sec 

ac 


Note that FE, is the electric field of a proton at a distance ag while t, is the time it takes 
the electron to travel a distance ag at the velocity ac in the ground state. A laser intensity 
of 3.55 x 10'°W/cm? has an peak electric field of magnitude E,. 


4.5 Light and atoms 


4.5.1 Absorption and stimulated emission 


The physical problem we are trying to solve consists of a collection of atoms with two 
possible energy levels interacting with light at a temperature 7. We want to understand 
the processes of absorption and emission of radiation and how they can produce thermal 
equilibrium. 

Let us first consider a single atom and the possible processes. Consider two possible 
levels of an electron in an atom 


> G 


we Dona 


ee 4 ee eee 


Qn b 


Define 
Ey — Eq 


h 


Imagine now shining light into the atom at a frequency way. There are two possibilities 
depending on the initial state of the atom: 


Wab = 


(i) Electron initially in |a): there will be an absorption rate at which a photon is absorbed 
and the electron goes from |a) — |b). 


(ii) Electron initially in |b): there will be a stimulated emission rate at which the photon 
field stimulates an electronic transition |b) — |a) with the release of an additional 
photon. 
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These processes are illustrated in the figure below: 


bafow allen 


NYS 7 Alsovphor 
nv Are Stile len 
aaialt n>  gradeon 

—e—_——- 


A LASER, Light Amplification by Stimulated Emission of Radiation, works by having pop- 
ulation inversion, namely most of the atoms are in the excited energy level. Then any small 
number of photons can trigger larger and larger numbers of stimulated emission processes! 


\ etter te ~~ 
~~?) Ry! ae 
p 1 ty fd TE ees 


4.5.2 Ejinstein’s Argument 


; N 
Atoms in states a or b with Ey > Ey, with populations N, and = , Ey 
Np respectively. The atoms are in equilibrium with a bath of 
photons all at temperature 7. Einstein discovered a number 
of relations from the condition of equilibrium. a ® , Fa 
Fact 1: Equilibrium values for populations: 
Na =N,=0 (4.5.1) 


Fact 2: Equilibrium populations governed by thermal distribu- 
tion 


(4.5.2) 
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Fact 3: For a thermal blackbody, the energy U(w)dw per unit 
volume in the frequency range dw is known to be 


A udu 
Process Rate 
Absorption: |a) — |b) BabU (wea) Na 
photon absorbed 
Stimulated emission: |b) + |a) BoaU (wa) No 


photon released 


In the table above we have indicated the two obvious processes: absorption and stim- 
ulated emission. The rate indicated is the number of transitions per unit time. For ab- 
sorption, this rate is proportional to the number N, of atoms in the ground state, capable 
of absorbing a photon, times U(wgy) which captures the information about the number of 
photons available, times a B coefficient By, that we want to determine. For stimulated 
emission, the rate is proportional to the number Ny of atoms in the excited state, thus 
capable of emitting a photon, times U(w,,) times a B coefficient By,. The inclusion of 
U(wap) reflects the “stimulated” character of the transition. 

Can we make this work, that is, can we achieve equilibrium? We will see that we cannot. 
With the two processes above, the rate of change of Np is 


N, = rate absorption — rate stimulated emission (4.5.4) 
At equilibrium MN, = 0, hence 


0 = Ny = Bas (wWoa)Na — Boa (wea) No 
= (BapNa — BoaNo)U (woa) 


= Ny (Bos = Byers) U (wa) (4.5.5) 


This is a strange result: in order to have equilibrium we need Bay — Brae 22 = 0. The 
B coefficients, however, depend on the electronic configurations in the atom and not on 
the temperature T’. Thus this cancellation is not possible for arbitrary temperature. We 
also note that the photon energy density does not feature in the equilibrium condition. In 
conclusion, equilibrium is not possible. 

What are we missing? Another process: spontaneous emission!!, an emission rate that 
does not depend on the thermal photons. The rate is proportional to the number of atoms 
Ny, in the excited state, with a coefficient of proportionality called A: 
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Process Rate 


Spontaneous emission: |b) + |a) AN» 
photon released 


Reconsider now the equilibrium condition with this extra contribution to Nj: 
O= Ny = Bal (woa)Na — Boal (woa)N, — AN, 
No 


A 1 


Bab efhwra —_— Boa 
Bap 


Na 
=> A= (Basix? — Ba U (wba) 


= U (wba) = (4.5.6) 


The equilibrium condition indeed gives some important constraint on U(w,,). But from Eq. 
(4.5.3) we know 


hu 1 
Vie) = 23 5 (4.5.7) 
Hence comparing the two equations we get 
A fiw? 
Ba = B FC 4.5.8 
ab ba an Bas Foes ( ) 


As we’ll see, we'll be able to calculate By, but A is harder. Happily, thanks to (4.5.8), we 
can obtain A from Bap. 


Spontaneous emission does not care about U(w»q); the photons flying around. At a 
deep level one can think of spontaneous emission as stimulated emission due to vacuum 
fluctuation of the EM field. 


For a given atomic transition expect stimulated emission rate to dominate at higher 
temperature, the there are more photons around. Spontaneous emission should dominate 
at very low temperature. Indeed from (4.5.6) 
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4.5.3 Atom/light interaction 


Focus on E field. Effects of B are weaker by O (2) ~ a. For optical frequencies A ~ 
4000 — 8000 A and since ag ~ 0.5 A, we can ignore the spatial dependence of the field (even 
for ionization frequencies a ~ 1700) 


Electric field at the atom 


E(t) = E(t)n = 2E cos(wt)n 


where n is the direction of the field. 
®(r,t)=—r-E(t), V®=—-E (4.5.9) 
as expected. The perturbation is therefore 
dH =+q®(r) = —qr- E(t) (4.5.10) 
By defining the dipole moment d as d = qr, we can rewrite the perturbation as 
6H =—d- E(t) = —d- n2Eo cos(wt) = 2(—d- nEp) cos(wt) . (4.5.11) 
Since we defined 6H = 2H’ coswt we can read 
H’ =—-d-nEp. (4.5.12) 
Recall that to first order in perturbation theory P;.q(t) = P,»(t) so let’s consider just 


P,»(t), the stimulated emission process, from (4.3.46) 


= 4| Hi ,|? sin? (4-1) = 4EG|(d- n)qp| sin? (“'5-“t) 
h2 (wba — w)? hi? (Wa — w)? 


Paeo(t) (4.5.13) 


Now think in terms of energy. The energy density wz in the electric field E(t) = 2Eo cos(wt)n 
is 
JE)? _ 4EG 4E§ 1 _ &% 
ve 87 87 iia (ux) time 8x 2 An ( ) 
In a wave the electric and magnetic energy are the same and therefore 


EG 
20 


Ee = 2n(u) (4.5.15) 


(u) time =2 (ue) time 
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The superposition of light is incoherent, so we will add probabilities of transition due to 
each component of light. For this we must turn a sum of electric field intensities into an 
integral. 


> ER (wi) = 2 i isha (4.5.16) 
ud) a 


where U(w) is the energy density per unit frequency. In this way the transition rate in 
terms of energy reads 


_ \(d- n)a0)? / sin? (4-1) 
Poe) = ge ae U(w)dw “(wg — wy? 


(d= n) asl? i: sin? (“45-“t) 
— RO B68) (on)\U (wap) | deo 2 — 45.1 
h2 ( T) (w b) WW (Wha ae w)? ( 5 7) 
As usual, take x = (“s-“t) => dw = 2%, so that (4.5.17) is 
2 in? 
; ef nda = 2tn (4.5.18) 
re 
An? 2 
Ravd = zr l(a : N)ad| U (wpa) (4.5.19) 


So far we have two vectors: d which depends on the atom geometry and n which is the 
direction of the electric field. Since light comes in all possible polarization, this amounts to 
averaging over all possible directions of n 


(Ieao -ma!?) = (| 7 deamsl?) = (( 0 diame) (D> larry) ) = D2 (Gin)*(Eia) (rans) 


iJ 


(4.5.20) 
Expect 
(ngNz) = (Nyny) = (nzNz) (4.5.21) 
Since 
( S mini ) Syed (4.5.22) 
then : 
(ning) = 3 (4.5.23) 
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1 1 
(Ides |?) = 343, - don = 5ldao)” (4.5.24) 


This is the magnitude of a complex vector! 


An? 
= 5 ldas|?U (wa) (4.5.25) 


This is the transition probability per unit time for a single atom 


Ar? 2 
Bab = 3j2 | Gaol (4.5.26) 
thus recalling (4.5.8) 
hus? hue, 40? 2 
A= Toes Bab = aes apa Aa eed 


hence we have that A, the transition probability per unit time for an atom to go from 
|b) + |a), is 


3 
4 wh, 


we 3 he 


|day|* . (4.5.28) 


The lifetime for the state is T = 5. The number JN of particles decays exponentially in time 


dN 


ae ek SSS e~ At — Noe "7 (4.5.29) 
If there are various decay possibilities A;, Ao,..., the transition rates add 
1 
Atot. = Ar + Aot... and T= (4.5.30) 
Atot. 


4.5.4 Selection rules 


(to be covered in recitation) Given two states |n@m) and |n’é’m’), when (ném|r|n/l'm’) 4 0? 
One can learn that 


(ném|z|n'é'm’) = 0 for m' Am (4.5.31) 
(ném|a|n'é'm’) = (némly|n'e'm') = 0 form’ Am+1 (4.5.32) 

Hence no transition unless 
Am=m!'—m=0, +1 (4.5.33) 


Additionally 
Ag=l—fL=+H1 (4.5.34) 
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1 Introduction 


1.1 Adiabatic approximation in Classical Mechanics 


Let’s go back to classical mechanics, with a harmonic oscillator performing motion but with w(t) not 


constant. The hamiltonian of the system would be 


2 
SUE. de eae 
H(x,p,w(t)) = 2 + Smur(t)x (1.1) 
where x and p are going to be functions of time. 
In general 
dH OH, , OH OH (1.2) 
dt dc | dp at 
Hamilton’s equation of motion read 
OH OH 
pee ee ae 1.3 


Even if you are not familiar with these, you probably know well their quantum analogs: 


A) = (fe, Hr) = (in ; He) _ (Si 
in? — (ip, a) = (-inZ ) , dy) | (2 


dt Ox 
(1.4) 


At any rate, equations (1.3) imply that the first two terms in the right-hand side of (1.2) vanish and 
we have 


dH OH 
We have an adiabatic change if the time scale 7 for change is much greater than time scale of the 
oscillation, JT’ = ay" 
AWE) 


‘T % T= 20 
Le) 


What remains roughly constant as w(t) changes adiabatically? 


Claim: J(t) = —~ _ is almost constant in adiabatic changes of w. (1.6) 


I(t) is called the adiabatic invariant. 
Let us see why this is so: 


“ = =( H(t)i) = 5 (w (muwa”) _ (£ + jrn?(t)2?) i) = 
= 5 (gons2(e)o” r) = “ (V(t) — K(t)) , (1.7) 


where V and K are the potential and kinetic energies, respectively. We want to understand why this 
right-hand side is small. Clearly, = is slowly varying if w is slowly varying, but the term in parenthesis 
is actually quickly varying. Indeed, with 


x= Asinwt and p= Amwcoswt 


then the parenthesis goes like 


2 
dmuw? x? — Care + mw” (sin? wt — cos? wt) ~ —}mw? cos 2ut , 1.8 
2 in 2 2 


which is a fast variation. Let us see, however, how much I(t) changes over one period 


aT aT i+? Gy 
I(t+T) -I(t) = / Shy i) ” #) (V(t) — K(¢)) dt (1.9) 
t dt! t Ww? 
Since w(t) is slowly varying, it changes very little over a period T and we can take the factor w/w out 
of the integral: 
Ww t+T 
t+7)—1() = 2) / (V(t) — K(t)) at! (1.10) 
w t 
In harmonic motion with w constant, the average over a period of the potential energy V(t) is exactly 
equal to the average of the kinetic energy A(t), and the two terms above would cancel exactly. When 
w is slowly changing they roughly cancel 
w(t) 


It+T)—It)~— > 0-0 (1.11) 


What is the geometrical interpretation of the adiabatic invariant E/w? For this we consider the 
motion of a classical oscillator in phase space (x, p) where periodic oscillations trace the ellipse defined 
by energy conservation: 


2 
1 
— a 5m -E (w, E, constants) 


We quickly see that the semi-major and semi-minor axes have lengths 


2E 
a=\/—z and b=V2mE (1.12) 
mw 


E 
Area of the ellipse = tab = an (1.13) 


Therefore 


The adiabatic invariant is proportional to the area of the ellipse. That’s neat! We can rewrite the 
area as an integral that takes a familiar form: 


Area of the ellipse = ¢ pdx 


where we integrate over the whole closed trajectory with the orientation shown in the picture so that 
the area above the x axis is obtained as we go from left to right and the area under the x axis is 
obtained as we go from right to left. In this example 


f pas i ie (1.14) 


More generally, for other systems the left-hand side is an adiabatic invariant. The value of the invariant 
is different for different systems, for the oscillator it is 27E/w . 


1.2 Quantum mechanics systems 


What does it suggest for quantum mechanics? 
First, for the harmonic oscillator 


E hw(n+3) 
oe W 


=h(n+3) (1.15) 


Fixed z is fixed quantum number! This suggests that in QM the quantum number doesn’t tend to 
change under adiabatic changes. Indeed, we get the same intuition from WKB: consider a potential 
with two turning points a,b. We have the Bohr-Sommerfield quantization condition 


rewritten as f v(c)aa = 2th (n+4) (1.16) 


The left hand side, as noted before is an adiabatic invariant, so in the semiclassical approximation for 
arbitrary potential we find that the quantum number may not change under adiabatic changes! We 
will see that indeed that is the case. 


The formula for transitions at first order in perturbation theory gives complementary intuition: 


—~ 2 
Bs 2 Oe E 
Pr. a(t) = / cient SHE) oy (1.17) 
0 


For constant perturbations, oH fi is time-independent and 


it 
/ elit dt’ 
0 


e If the spectrum is discrete, it is hard to jump a big energy gap because of the a suppression. 
Fi 


para 2: 
| H;|? 
Prei(t) = = | 


2 7 |5H pil? eiwpil _] 


1.18) 
2 2 ( 
h We; 


e For slowly varying perturbations (compared to we;) the —r suppression will remain. 
fi 


So it is difficult in general to change state with constant or slow perturbations, suggesting again that 
quantum numbers are adiabatic invariants. That is why transitions to the continuum with constant 
perturbations essentially conserve energy. And why you need oscillatory perturbations for efficient 
transitions between energy levels. 


2 Adiabatic Evolution 
Suppose you have found a w(t) such that 


A(t)|o(t)) = E®|Y@) (2.1) 


we'll call |y(t)), defined by (2.1), an instantaneous eigenstate. I’d like to emphasize from the 
start that, in general, an instantaneous eigenstate is not a solution of the time dependent Schrédinger 
equation. As it turns out, it is a useful tool to construct approximate solutions to the Schrédinger 
equation. 

Let us try to understand the relation between |w(t)) and |W(t)), the solution to the Schrédinger 
equation 


ihd,|U(t)) = H(t)| V(t) . (2.2) 


Let us try an ansatz, building the solution |W(t)) using the instantaneous eigenstate: 
1 : / / 
Wo) = ees (Ff Ewha) woo), (2.3) 
with c(t) a function of time to be determined and where we have included a familiar time dependent 


phase. There is no guarantee this ansatz will work but it seems a good try. 
The LHS of the Schrédinger equation then looks like 


indy|¥(t) = e(Bex (5 [ 20ae’) I(t) + EO)W()) + c(texp (5 fe E(t tat) ji )) (24) 


For the RHS, using the instantaneous eigenstate equation, we have 


H(t)|W(t)) = dither (5 fe B(t that) (lb) = B()|W()). (2.5) 


Equating the two sides, we get 


é(t) exp (5 [ew ja) |e) + el Jexp (5, [ee ya) 4 t)) = 


and canceling the two exponentials we have 


e(t) |b(t)) = —e(t) |W(E) - (2.6) 


Multiply by (w(t)| we get a differential equation for c(t): 


et) = —e(t) (W(HIY(), (2.7) 


which happily we can solve. Letting c(t = 0) = 1 we can write 


= exn (- | ‘(we)id(e)yae) , (2.8) 


The above exponential is a phase because the bracket in the integrand is actually purely imaginary: 


y= favs = fac 5 ae fete 


(2.9) 
= [de ow — (bw) 
Since the wavefunction is normalized we have 
WOW) = —-@MWO) = -HOWO)* (2.10) 


showing that indeed (w(t)|y(t)) is purely imaginary. To emphasize this fact we write 


i) = ex (7 | “ioe HU) ae) (2.11) 


Having apparently solved for c(t) we now return to our ansatz (2.3), we get 


IW(t)) ~ (0) exp (« fae e)O)at ) exp (5, [Bw ya) Joe). (2.12) 


But there is a mistake in this analysis. We really did not solve the Schrédinger equation! That’s why 
we put a ~ instead of an equality. 

The equation we had to solve, (2.6), is a vector equation, and forming the inner product with (w(t)| 
gives a necessary condition for the solution, but not a sufficient one. We must check the equation form- 
ing the overlap with a full basis set of states. Indeed since |~(t)) is known, the equation can only have a 
solution if the two vectors |¢(t)) and |2)(t)) are parallel. This does not happen in general. So we really 
did not solve equation (2.6). The conclusion is that, ultimately, the ansatz in (2.3) is not good enough! 


We can see the complication more formally. At t = 0 equation (2.6) reads 


€(0)|4b(0)) = —e(0)|(0)) . (2.13) 


Using Gram-Schmidt we can construct an orthonormal basis 6 for the state space with the choice 
\W(0)) for the first basis vector: 


B={ 11) = |~(0)), |2), |3),--- iy (2.14) 


Equation (2.13) requires 

(n|~(0)) = 0 = 2s es (2.15) 
This will not hold in general. The key insight, however, is that (2.12) is a fairly accurate solution if 
the Hamiltonian is slowly varying. Making the definitions: 


if : / / 
a E(t), — v(t) 


so that #, vy and ¥ are all real, the state reads 


|W(t)) ~ (0) exp (i7(t)) exp(iO(t))|o(4)) . (2.17) 


(dO), ry) = [ venue (2.16) 


The phase 6(t) is called the dynamical phase and the phase y(t) is called the geometric phase. 


Comment: Instantaneous eigenstates are rather ambiguous. If one has the |w(t)), they can be modified 
by the multiplication by an arbitrary phase: 


bE) > WO) = e PO). (2.18) 


3. Systematic approach 


Must use a basis to see the error terms and get a more general picture. We will do this now sys- 
tematically. But first let us state precisely the idea in adiabatic approximation. Consider a family of 
instantaneous eigenstates: 


A(t)|dn(t)) = En(t)|Yn(t)) (3.1) 


with E\(t) < Eo(t) <..., so that there are no degeneracies. 
Adiabatic approximation: If at t = 0 |W(0)) = |yn(0)) for some n, then if H(t) is slowly varying for 
0<t<T then at time T we have |7)(T)) ~ |v,(T)) up to a calculable phase. 

Key facts: 


1. The probability to jump to another state |~,(t)) with k 4 n is highly suppressed. 


2. The phases can sometimes be relevant and have geometric meaning. An overall phase cannot be 
observed but if we have a superposition 


| (0)) = ci |41(0)) + e2|h2(0)) +... (3.2) 


the relative phases acquired by |w1(0)) and |22(0)) after time evolution can matter. 


Calculation: 


|U(t)) = So en()lvn(t)) (3.3) 
Schrédinger equation : 


MY? (Calbn(t)) + enldbalt))) = Yo ent) Bn()ldn(t)) (3.4) 


act with (a,(t)| from the left: 
they, = Exce — ih S~ Weln) en 
ihc, = (Fi = in( dele) ee) — ih ST (bel )en (3.5) 
n£k 
We can relate (w|qn) to a matrix element of H(t) in the space of instantaneous eigenvectors 
H(t)|Un) = En(t)!Yn) (3.6) 


take time derivative 


H(t)|Yn) + H(#)|%n) = En(#)|n) + En(t)|en) (3.7) 


multiplty by (w,(t)| with k An 


(de(t)|H (Yn) + x(t) (ve (t)ln) = En(t) (We (t) len) (3.8) 
hence : : 
-\ (belt) (t)lyn) _ Hen 
We can use plug (3.9) back in Eq.(3.5) to get 
ihc, = (4 = im(delx) cr) = ins” i B? (3.10) 
n£k ny 
Note that if the term proportional to ie vanishes, then |c,| = 1, hence if you start in |q,) you 
stay in |w,). 
If we ignore the extra term: 
Lae 
cx(t) = ex(O)ex | f° (Bute) — elven) a 
t t ; 
=ex(o)exn (5 ff Bx(tyat ) exp (if ivnlus)at) 
= ¢,(0) eM erm) (3.11) 
where 
1 t 
O(t) =—= | E,(t’)dt! 3.12 
= 5 f H(t (3.12) 
Uy (t) = ie (t)|Va(t)) (3.13) 
t 
or= eid 3.14 
vOe= f v(t) (3.14) 


4 lLandau-Zener transitions 


Take w(x; R) and q2(2; R) to be the electronic configurations of a molecule with fixed nuclei separated 
at a distance R. 
We have 


A(R)di(x; R) = Ei(R)yi(z; R) (4.1) 


E, (R) 


Depending on R the states are polar or non po- 
lar changing characteristic near R = Ro. If R 
changes slowly, ~, remains w or w2 will remain 
ye. But if R varies quickly, then yw, — we near 
Ro. This would be a non-adiabatic transition. 
Thinking of R as a function of time, we have a 
time dependent hamiltonian 


H(R(t)) Ro R&R 


with instantaneous energy eigenstates 


Wi(a; R(t) , E;(R(t)) 


we can now plot the energies for constant velocity R 


E E, (2) 


E,W) 


t% 4 


We can sometimes think of this as a 2-level problem; if no other eigenstates are relevant. 
So consider a slightly idealized 2 x 2 system 


H(t) = G =) , a>0 j1) = @ (spin up) 


Ex (t) = oe Eo(t) = re [2):= (7) (spin down) (4.2) 


_ ane (t) 
(s) N (2) 


The instantaneous eigenstates |1), |2) are actually time independent. Take 


id= (5) wad = (4) (4.3) 


\1) solves the Schrédinger equation (4.4) 


|2) solves the Schrédinger equation (4.5) 


Time evolution is clear: if |) = (6) at t = —oo it will remain in that state forever. Same for 


Is) = oF 


Now make the problem more interesting by perturbing the hamiltonian 
at 
= dts 
Hae) (4.6) 


with Hy. constant. 


At t = 0 the energy eigenvalues are +|Aj9| 


e Et) 3) 


10 


a2t 
Ex(t) rick etal (4 8) 
we can write H in a compact form by defining the vector 
at 
a= (112.0. *) =|ajn = Ex (t)n (4.9) 
taking Hy. to be real 
so that 
H=a-o (4.10) 


The instantaneous energy eigenstates are 


|tn(t)) with energies Ex(t) 


VRC) 
incady= (4) [-nedy = (6) 
(n(o)) = a ) \-ne)y = a 


The instantaneous energy eigenstates change type 


0 1 
(1) > (5) (polar - non polar) 
@ > (7) (non polar - polar) 


|Hi9| 


An adiabatic process will keep you there. Consider 7g for the duration of the change la) = Ta > since 
t t 
ne S S| = ‘elt (see fig) (4.11) 
Also [ial = wy and it is called Rabi frequency. It’s relevant for the transitions at t = 0 since 
0 Aye 
H(0)= : 
On. 
Process is adiabatic when , 
———— H 
W12Ta > 1) ==> a > 1 (4.12) 


11 


1.e. 


2 
ee ee (4.13) 
W112 


hence, the adiabatic change is much slower than the natural time scale of the system. 
Alternatively: The Hamiltonian should not vary much over the time scale T associated with the relevant 
energy difference AE 


dH 
| H 4.14 
rel < (4.14 
Approximate T = z if AE ~ HA 
h dH d {1 h 
a haaly ats 4.1 
ma|<1 > |g (s)/<1 or a (3) (4.15) 
analog of la. (4)| < 1in WKB. 
For our case 
g <1 (4.16) 


a J |Aie|? + at 


Let’s check! | Ett) 


2 
d h = hart — ha at 


3/2 3/2 
@ line +2] | (e+ 22) (|i? + S*) 


VE) 
t ha|H h 
< max ha = 372 ~) 4 3 = oP <1lv 
242 
(|Hisl? + ot ) 12 12 7 


(4.17) Max of (a2 be2)372 


happens for 2? = = 


Probability for a non-adiabatic transition 


H 2 
Prad. = exp (—2mW12T.) = exp (-20! 121 ) (4.18) 


5 Berry’s phase 


Adiabatic theorem: |y~(t = 0)) = |wn(0)) and with instantaneous eigenstates 
|Wn(t)) we find 


W(t) x eM et O lap, (¢)) (5.1) 
with, as in (3.12) 
1 f / / 
a(t) = —5 / Pal Jat (5.2) 
Un (t) = in (t)|Yn(t)) (5.3) 
oi i va(t!dt (5.4) 


We now understand the relevance of 7,(t), the geometrical phase: 
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Assume H depends on a set of coordinates 


R(t) = (Ri(t), Ro(t),..., Rn(t)) (5.5) | aie 
several parameters that are tune dependent. - 
Instantaneous states ra 4 Q 2 
R30) 
A(R)|bn(R)) = E(R)|n(R)) (5.6) Ki 


Evaluate the geometric phase. Start by computing 


a(t) =7 Yn(R(E))]<fdn(R())) (6.7) 
we need + |un(R(t))) 
N 5 
fila) = > apg, PaO) Fe = Pen) TP (5.8) 
so that ae 
a(t) =i VaR) Priva) BO (65.9) 
and 
int) =f" va(dde= [i vn(RO)|Falin(Re)) Sar (5.10) 
hence the geometrical phase 7,(tf), also known as Berry phase, is 
Ry 4 
intts) = ff "4 Ya(R)|Frx|vn(R))- AR (5.11) 


The integral depends on the path, but does not depend on time!! Whether you take a nano second or 
a year to make transition, the geometric phase is the same! 


i dn(R)|Vrivn(R)) is an N—component object that lives in the parameter space. It is called the 
Berry connection A,(R), associated with |y,,(t)) 


A,(R) = 7 Un(R)|Vr|dn(R)) (5.12) 


In this way we can rewrite the Berry phase as 
Rf 
Ae A,(R)-dR (5.13) 
Ri 


If we redefine the instantaneous states by an overall phase 


|Un(R)) + [dn(R)) = ey, (R)) (5.14) 
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where 6(R) is an arbitrary real function, what happens to A,,(R)? 
An(R) =i Un(R)|Ve|Yn(R)) = 


=—j Yn(R) |e PV Re PF ®) |y,(R)) 


=i (-?¥R8(R)) (bn(R)lvn(R)) + An(R) (5.15) 
1 
A,(R) = An(R) + VrA(R) (5.16) 


analogous to the vector potential in EM (A! + A’+ VE ). 
What about the Berry phase? 


Ry Ry 
F(R) =f" An(R)-AR=W(Ry) +f" FRS(R)-aR (5.17) 


In(Re) = (Ry) + BCR) — (Ri) (5.18) 


The geometrical phase is completely well defined for closed paths in parameter space. 
Comments: 


e If the ~,,(t) are real, then the Berry phase vanishes: 
% d 
Um = Vn (t)|Yn(t)) = tf ax alad Gelb) 
d. d. : 
=i f dxvn(t29 5 Oa) ee 
1 

= 55, | exlvn(t29? = (5.19) 

since the wavefunction is normalized. This should not be surprising. In general grounds we 


have seen that (q,(t)|Wn(t)) is purely imaginary. But if wp is real this overlap cannot produce 
a complex number, so it can only be zero. 


e If there is just one coordinate, i.e. R = R, Berry’s phase vanishes for a loop 
$ A,(R)dR=0 —1~—>=+— (5.20) 


In here the integral from R; to Ry is cancelled by the integral back from Ry to Rj. 


e In 3D we can make use of Stoke’s theorem to simplify the calculation of 7,[C] 


= f An) ar = [f/f (Vv x A,(R)) ad= ff Dy a5 (5.21) 


where S is the surface with boundary C' and we defined the Berry curvature Dy, 


D, = V x A,(R) (5.22) 
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If we think of the Berry connection as the vector potential, the Berry curvature is the associated 
magnetic field. Note that the curvature is invariant under phase redefinition 

Dn + Di, = Vx A,(R) =V¥ x (An(R) + ¥A(R)) oy 
= VxA,(R)+Vx<¥6(R) = V x A,(R) = Dn. 


Example: Berry phase for an electron in a slowly varying magnetic field. Take a magnetic 
field with fixed intensity and slowly varying direction B(t) = Brit), put an electron inside this 
magnetic field, so that we get an interaction described by the hamiltonian 


H(t) = —p- B(t) = we Bi(t)-é (5.24) 


The direction of the B field, 7(t) swipes a closed loop C on the 


Z 5 ‘ : 5 h 
surface of an imaginary sphere of radius p in a time scale T > = zB: 


Think of the instantaneous eigenstates lx (R(t))) satisfying 


H(t)|x+(R(t))) = +ueBlx+(R(t))) (5.25) 
where 
R(t) = (r(t) , A(t), d(t)) (5.26) 
7 
=p (fixed) 
We get 


= 1 ~ = 1 1 Q 
= ra = Di. = —_ fp. fr Q= z= Q= 7 — 2 
Ds F5,a? +[C] II. +°dS =/f arte d +5 ik a5 5 (5:27) 


6 Molecules and Born-Oppenheimer 


Molecules are much harder to treat than atoms. Atoms are hard because even though the potential 
created by the nucleus is spherically symmetric, the Coulomb interactions between the electrons breaks 
the spherical symmetry. In molecules even ignoring Coulomb repulsion between electrons, the potential 
created by nuclei that are spatially separated is not spherically symmetric. 

In some approximation one can view a molecule as a system in which nuclei in classical equilibrium 
with well localized positions while the electrons move around in the Coulomb potential created by the 
nuclei. This approximation is reasonable since typically 77 ~ 10-4 where m is the electron mass and 
M the nuclear one. In this picture slow nuclear vibrations adiabatically deform the electronic states. 


In order to make estimates, consider a molecule of size a, so that 


h h? 
Pelectron ™ and Electron ~ aa) (6.1) 
a ma 
The positively charged nuclei repell each other but the electrons in between create an effective attrac- 
tion that, at equilibrium, cancels the repulsive forces. There will be nuclear vibrations around the 
equilibrium configuration. Consider the nuclear oscillations governed by the nuclear Hamiltonian 
P?2 


Hy = —+2k2’? 2 
N aut 2 he (6.2) 
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The restoring force is determined by k, and it is due to the electron system, with no reference to the 
mass M. Since & has units of energy over length-square, we must have 


_ Energy h? 


k ip aes (6.3) 
But k = Mw”, with w the frequency of nuclear oscillations so we have 
h mh? m_ he 
Mis? ws 2 hw ~ ,/— ——. 6.4 
* Rae oe M m?a* M ma? 64) 


We thus find that the nuclear vibrational energies are 


m 
vibration ~ 4/ 7g Lelectron : (6.5) 


There are also rotations and their energy Eo; is even smaller, as they involve essentially no distortion 


of the molecule 


I? Wee+1) mA 


EB = = ~N ———— 
nO SOF Ma? M ma? 


m 
rotation = Ty Eelectron (6.6) 


Therefore we have the following hierarchy of energies 


m m 
Electron : Evibration : Exotation = is \/ iva : M (6.7) 


6.1 Born-Oppenheimer approximation 


Consider a molecule with N nuclei and n electrons. The Hamiltonian takes the form 


Pe ” p? 
H= > + Vvn(R og Ven (R, rt) + Vee(r), (6.8) 
where My with a=1,...,.N are the nuclear masses and 
P,, Ra: nuclei canonical variables Vnn : nuclei-nuclei interaction 
Pi, r; : electron canonical variables Vee: electron-electron interaction 
R = (R,,..., Rn) Ven : electron-nuclei interaction 


r= (rj,.--,In) 


The wavefunction w(R,r) for the molecule is, in position space, a function of all the nuclear positions 
and all the electron positions. 

In the limit when M,,/m — oo the nuclear skeleton may be considered fixed, making the positions 
R are fixed. The electrons then move under the effect of the nuclear potential V.y(R,r) with fixed R, 
and the electron-electron Coulomb repulsion. The relevant Hamiltonian H, for the electrons is then 


Help.rsR) = 7 PE 4 Ven(Ryx) + Veelt). (6.9) 


This is a different Hamiltonian each time we change the positions R of the nuclei. The associated 
Schrodinger equation for the electrons is 


2 7 . 
-~ 32 V2, + Ven (Bir) + Vee(r)| of’ (r) = BO (ROR (r). (6.10) 
41 


The wavefunction for the electrons, as expected, is a function of the position of the electrons r, which 
appears as the argument of the wavefunction. Since the wavefunction depends on the nuclear positions, 
this dependence is included as a subscript. Finally, the superscript 7 labels the various wavefunctions 
that may appear as solutions of this equation. The associated energies BEO(R) depend on the nuclear 
positions and the label 7. If we calculated all the oY (r) we would have a full basis of electronic 


configurations and we could write an ansatz for the full wavefunction of the molecule: 


HBr) = o(R)de(r), (6.11) 


where the 7 are the unknowns. Substitution into the full Schrédinger equation 
Ay(R,r) = Ev(R,r), (6.12) 


gives an infinite set of coupled differential equations for 7 (R). This is too difficult, so we try to 
make do with a single term: 
v(R, r) = n(R)or(r) ’ (6.13) 


where we would generally use for dg(r) the ground state wavefunction for the electrons in the frozen 
nuclear potential. If we have this wavefunction for all R, as we now assume, we also know the value 
E-(R) of the associated energy. 


We will do a variational analysis. For this we will compute the expectation value (w~|H|w) using 
v(R,r) = 7(R)dr(r). We will then utilize the known ¢R(r) to integrate the electronic dependence in 
the expectation value 


(H) = / dRar U*(R,r)H U(R,x) = i dRar 1*(R)oR(r)H (R)dn() 
(6.14) 
= / dR 7*(R) Hog n(R), 


thus discovering the effective Hamiltonian Heg¢ for the nuclear degrees of freedom. We begin the 
calculation by rewriting the original Hamiltonian: 


H=Hy+He, Hy= > —*% 4+Vyy(R). (6.15) 
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As a warmup we calculate the expectation value of H.: 
(H.) = f dR ary" (RoR (0) Heron 


7 / dR"(R) | / dr 6 (t)Hedr(r)| n(R) 
aay 


= f ak R)ER)n(R), (6.16) 


This term has contributed to the effective nuclear Hamiltonian the value of the R dependent electron 
energy. Now the nuclear term 


2 
(se) 7 ia / dR dr n* (R)O%(r)PaPa(n(R)or(r)) 


1 
= sz | Rater RoR (Pa [(PaiB)) nls) +0(R)(Padatr))] (6.17) 
We want to move ¢p(r) closer to the other ¢R(r) to be able to do the integrals. For that we use 
op (t)Pa = Padr(r) — (Padr(r)), (6.18) 


where in the last term, as indicated by the parenthesis, the derivation is acting only on the field dR (r). 
Hence 


2 
(Re) = spe [dBc a(R) {Po [oa(0)(PaR))on(0) + di(0)n(R)(Pad(o) 
— (Pada(r)) |(Pan(R)) én(x) + n(R)(Podn(r))] $ (6.19) 


We can now move in the dr integral 


(Fe —— i: dRy"(R) {P. [Pan(R) | dr d&(r)or(r) + n(R) / dr d(t)Poon(r)| 


— (Pan(R)) je (Padp(r)) oR) +R) [ dr (Padie(t)) (Poor tt) | (6.20) 


It is now convenient to define 


A(R) = in f ae or(r)Vr, or(r) = - [a or(r)Podr(r) (6.21) 


This is a Berry connection! We have a full Berry connection for each nucleus (thus the label a). The 
Berry connection arises from the electronic configuration. With this definition we have 


2 
(se) = saz f aR (R) { P2n(R) ~ Pan(R)An(R) ~ Ao(R)(Pon(R)) + 1?n(R) f dr |Ve, on(r)|"} 


= 2 2 2 
= f amet) {Par Ael — Aa 4 I [arom onl} n(®) (6.22) 
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Hence 
(x Oe Dake t Dan, [a lvecenl| 008) (o29 


so that the total expression reads 


| = f arn) 


PA 
2M, 


(Pa — Aa) Boh 
~ je (R) d, ou + Wwn(R) + B(R)-) e+) ag je |Vr.r| | (R) 
(6.24) 
and separating (H) in kinematics and potential term 
= * (Pa = Aa)” 
(H) = f ana (R) ai + CR)] ntR) (6.25) 
we can read from (6.24) the effective potential U(R) 
y hi? 2 Ad (R) 
U(R) = Viw(R) + BAR) + oyz fa |Vaaon| “2 OM, (6.26) 
Since . ; 
Py she Vee nese (vr, x As] (6.27) 
i i h 
the Schrédinger equation for nuclear motion is 
h? he a 
ar. (Ve, — FAs) +0(R)] n(R) = Bu(R) (628) 


Berry’s phases are not there if g(r) can be chosen to be real (note that we still have the if dr \VRra or] 
term ). Lowest order Born-Oppenheimer: 


h2 
H=-\- OM, VR + Vyn(R) + E.(R) (6.29) 


a 


6.2 Following the direct route 


H=Hy +H. =Ty + Vnn + He (6.30) 
where 
x h2 2 
y= d. aM. VR. (6.31) 
0.6 = B(R)\Og® 6.32 
cOR e(R)\ dR (6.32) 


and 


v= >> RoR). (6.33) 
Therefore the eigenvalue equation Hw = Ew looks like 
Hb => Twn (Roy (e) + Dn (R) “ + He) dy (r) 


= Liv + Vow + BOR CRI) — OD ay,  [2¢Van(R)) (Tse, ) +n! (R)VR, 0 
SE ee) (6.34) 


We now multiply by f dr gel (r) 


% i. : : : h2 ie 
[By + Pw + ER] nO = B/S [2(TR.0) far oP OVR.LE) +0 f dro VR.) 


(6.35) 


” q” 


Suppose there was no sum over ”7”, so that j =i 


= Vr. Ga / uaa — (Vn) = Aa(R) — 9 / drVr, og) (0)VR. 8 (2) = 


1 ‘ 
= 5 Fa G =A) — (Vr0 =A, (R) — 0! / dr|Vr,,¢2 (x) (6.36) 
Back 1 
S> sag Pe (n'Aa) — AgPan® ip —7®) i dr|Vr.o (0)| (6.37) 
would get perfect agreement!! 
6.3 The Hz ion 
The electron Hamiltonian is 
2 C - 
_~_ o_ef1,1 : 2 
HA. = raed e€ (= ~) _ ae — (6.38) 
=—aTS&C&“UM@{C’T-—"“ _ R == 
fe Vans ? Pp 
Do a variational ansatz 
1 oe 
= Alvo(ri) + Yo(r2)] Yor) = j= ge * (6.39) 
Impose normalization of the wavefunction 
1 R Ro. TfRy* 
oa Al? ith I=ea |1 as 6.40 
|| | A| +H wit € Te a ceeael (6.40) 
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Since (H.) ~ E.(R) the full potential is 


E.(R) + Vwn(R) (6.41) 
—S 


cad 
R 


Experimentally we have R = 1.06A and E = —2.8 eV. 
Ansatz is not very accurate, for small R wavefunction becomes ground state of Hydrogen atom not 
ground state of Helium, as it should. 
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Chapter 7 


Scattering 


© B. Zwiebach 


In high energy physics experiments a beam of particles hits a target composed of par- 
ticles. By detecting the by-products one aims to study the interactions that occur during 


the collision. 


— (i >0D— = dahkatan 
boar onggh ‘ oO. 


Collisions can be rather intricate. For example, the particles involved may be not 
elementary (protons) or they may be elementary (electrons and positrons) 


p+p > pt+p+7 
Prp > pP+tn+T 
ether =k ara 
The final products may not be the same as the initial particles. Some particles may be 
created. 
Here collisions are like reactions in which 


a+b > etdt+e+... 


We have scattering when the particles in the initial and final state are the same 


a+b—> a+b 
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The scattering is elastic if none of the particles internal states change in the collision!. We 
will focus on elastic scattering of particles without spin in the nonrelativistic approximation. 


We will also assume that the interaction potential is translational invariant, that is, 
V(ri —Yr2). It follows that in the CM frame the problem reduces, as we did for hydrogen 


atom, to scattering of a single particle of reduced mass off a potential V(r). We will work 
with energy eigenstates and we will not attempt to justify steps using wave-packets. 


7.1 The Schrodinger equation for elastic scattering 


We are interested in energy eigenstates 


(7.1.1) 
(7.1.2) 
[-# ov +V (7.1.3) 


Figure 7.1: Potential is finite range, or vanishes faster than+ ,, as rT — 00 


Will be consider solutions with De energy. The energy is the energy of the particle 
far away from the potential, F = . The Schrédinger equation then takes the form 


—~—— (V? +k?) +V(r)| v(x) =0 (7.1.4) 


Now we must set up the waves! Recall the 1D case. Physics dictates the existence of 
three waves: an incoming one, a reflected one, and a transmitted one. We can think of 
the reflected and transmitted waves as the scattered wave, the waves produced given the 
incoming wave. Setting up the waves is necessary in order to eventually solve the problem 
by looking into the details in the region where the potential is non-zero. When the potential 
has finite range the incoming and scattered waves are simple plane waves that are easily 


'Frank-Hertz experiment involved inelastic collisions of electrons with mercury atoms in which the mer- 
cury atoms are excited. 
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written. 


Equation (7.1.4) has an inifinite degeneracy of energy eigenstates. When V (r) = 0, for 
example, e’* for any k such that k-k = k? is a solution. Assume an incident wave moving 
towards +2. Then the wavefunction will look like 


g(r) =e (7.1.5) 


If we assume V (r) has a finite range a, this y(r) satisfies (7.1.4) for any r > a. For r < a, 
however, it does not satisfy the equation; y(r) is a solution everywhere only if the potential 
vanishes. 

Given an incident wave we will also have a scattered wave. Could it be an w(r) = e 
that propagates radially out? 


ikr 


(V?+k*)e*" 40 fails badly for r 4 Ol! (7.1.6) 

On the other hand * 
(V2+k2)——=0  forr £0. (7.1.7) 

r 


This is consistent with the radial equation having a solution u(r) = e’"* in the region where 
the potential vanishes. Recall that the full radial solution takes the form u(r)/r. 
Can the scattered wave therefore be as ? Yes, but this is not general enough. We need to 
introduce some angular dependence. Hence our ansatz for the scattered wave is 


eikr 


Ws(t) = fe(9,¢) (7.1.8) 


r 


We expect the intensity of the scattered wave to depend on direction and the function 
fx (0,0) does that. We will see that ~, is only a solution for r > a, arbitrarily far. 
Both the incident and the scattered wave must be present, hence the wavefunction is 


tkr 
V(r) = vole) + ole) =e + fal8,4)—, >a. (7.1.9) 
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As indicated this expression is only true far away from the scattering center. We physically 
expect f;,(6,¢) to be determined by V(r). f,(0,¢) is called the scattering amplitude. 


We now relate f;,(0,@) to cross section!! 


# particles scattered per unit time 
d into solid angle dQ about (6, ¢) 
Oo = 


it particles (7.1.10) 


[Aux of incident particles =7 ae 


do is called differential cross section, it’s the area that removes from the incident beam the 
particles to be scattered into the solid angle dQ. Let us calculate the numerator and the 
denominator. First the denominator, which is the probability current: 
hk . 

z 


m 


(7.1.11) 


h 

sae = _ im [-* Vi] 

Intuitively, this can be calculated by multiplying the probability density |e’**|? = 1, by the 
— hk 


velocity 2 = ie . The result is again an incident flux equal in magnitude to AR 


To calculate the numerator we first find the number of particles in the little volume of 
thickness dr and area r7dQ 


ar 
dn = number of particles in this little volume v 
AD 
etkr 2 
dn = |b(r)/? dr = | fx(8,6)——] r?dQdr = | f(8, 9)|? dQ dr (7.1.12) 


With velocity v = ak all these particles in the little volume will go cross out in time dt = dr 
therefore the number of particles per unit time reads 


dn >dQdr hk i 
& ~ Wed)? EX = Lil, 8)? a0 


Back in the formula for the cross section we get 
HE) £0, 0) dO 


a 


do = 


hence 


Differential cross section: < = | f,(0, 0) (7.1.13) 


Total cross section: |o = [eo = i \fe(0, 6)|\? dQ (7.1.14) 
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7.2 Phase shifts 


Assume V(r) = V(r), so that we are dealing with a central potential. First recall the 
description of a free particle in spherical coordinates. With 


h2k? 
va0, B=8E yey = Oy, (7.2.1) 


the Schrédinger equation for ugg 


he 2 p2 (€+1) c= R2k2 (7 
Qndr2. 2m_——sr? 2 aa em aC 
d2 &(€+1) 3 
( ae) 2 ) ues) = k*upe(r). (7.2.2) 
Now take p = kr, then (7.2.2) reads 
dd? é(é+1 
2 2 upe(p) = Uee(p) (7.2.3) 
dp p 


Since k? disappeared from the equation, the energy is not quantized. The solution to (7.2.3) 
is 


upe(p) = Acpje(p) + Bepne(p) or uge(r) = Aerje(kr) + Ber ne(kr) (7.2.4) 
where 

je(p) is the spherical Bessel function jo(p) is non singular at the origin 

n(p) is the spherical Neumann function n(p) is singular at the origin 


Both have finite limits as p > oo 


PJje(p) > sin (+ = =) (7.2.5) 
pne(p) + —cos (+ — 3) (7.2.6) 


Now, with the plane wave a solution, we must have 


elke — cikrcos? _ SS agP¢(cos 0) je(kr) (7.2.7) 
=0 


for some coefficients ag. Using 


2041 ia ke 
Y20(0) = = Pi(cos@) and u(t) = 5 | e™" Pr(u)du (7.2.8) 
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it can be shown that 


elke — Ag s V20 414° Yoo(O)je(kr) . (7.2.9) 


£=0 


This is an incredible relation in which a plane wave is built by a linear superposition of 
spherical waves with all possible values of angular momentum! Each ¢ contribution is a 
partial wave. Each partial wave is an exact solution when V = 0. 

We can see the spherical ingoing and outgoing waves in each partial wave by expanding 
(7.2.9) for large r: 


br 1 [eilr-S) — e-iler-F) 
je(kr) 4 — sin G . ) Sik | ; 7 ( 0) 
Thus 
' J4r 1 p etler-F) i(kr— 4) 
elke — => 241i Yeo(8) 5 s - 5 - FSG (7.2.11) 
£=0 
outgoing incoming 


7.2.1 Calculating the scattering amplitude in terms of phase shifts 


Recall 1D case 


1 F ; 
4 y(t) =sin(ka) = s (e* ae ) solution if V = 0 
=< ? : ingoing 
OO x 
(7.2.12) 
Exact solution ‘ 
oz) = 5 (ettete — e tke ) for 2 >a (7.2.13) 
i “YH 


_ same 
ingoing 
wave 


where the outgoing wave can only differ from the ingoing one by a phase, so that probability 
is conserved. Finally we defined 


(@) = os(x) + v(@) (7.2.14) 
So do asimilar transformation to write a consistent ansatz for ~)(r). We have from (7.1.9) 


ikr 
wr) 2 elk 4 fe(@)— , ra. (7.2.15) 
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The incoming partial waves in the left-hand side must be equal to the incoming partial 
waves in e’** since the scattered wave is outgoing. Introducing the phase shifts on the 


outgoing waves of the left-hand side we get 


_ Vin oo 1 Blo le id, eo i(kr—-F) " etkr 
ae > + 1% Y; E = elke 
7 J22+ 1% 2,0(9) = : - + fr(9) z 
l= ee ee 
outgoing incoming 


The incoming partial waves in e““* cancel in between the two sides of the equation, and 


moving the outgoing partial waves in e“** into the other side we get 
- iln . 
An oO : i + etkr e— etkr 
YAY V2EF Li Ve 0(6) 5 (7 = 1) = f,(0 7.2.16 
FL VE FTE YAO) 5; (¢ = (0) (7.2.16) 
= ee Se 


e?e sin 5p 


Var = 
= “= Ss" V2 +1Y20(0)e% sin d¢ 
¢=0 


ikr 


(7.2.17) 


where we noted that e~'2 = (—i)* and i£(—i) = 1. Therefore we get 


fi. (8) = “ Ss" V2l2+1 Yoo(0)e% sin dg . (7.2.18) 


This is our desired expression for the scattering amplitude in terms of phase shifts. 
We had 
do = | fx (0)|?dQ (7.2.19) 


and this differential cross section exhibits 6 dependence. On the other hand for the full 
cross section the angular dependence, which is integrated over, must vanish 


aos Paa= f ROH 


= a * J 20 +1720 + Le“ sin dye" sin dy i dQ ¥/'9(Q)¥er,0(2) 


Le! SS 
Ogg 


Hence 


_ An 


Ga a5 S "(26+ 1) sin? db: . (7.2.20) 
l= 


Now let us explore the form of f(@) in the forward direction 0 = 0. Given that 


2€+1 2@+1 
Ye,0(9) = 4/ re Pr(cos0) = > Yeo(@=0) = ae 
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then ee i 
fr(O = i ee vai y/ tt e¢ sin by = zt 20 + 1)e% sin dy 
l=0 l=0 
Hence 
i 1 k? 
= yo (2€ + 1)e* sin? 5, = Eine (7.2.21) 
l=0 


Therefore we have found out a remarkable relation between the total elastic cross section 
and the imaginary part of the forward (@ = 0) scattering amplitude. This relation is known 
as the Optical theorem and reads 


4 
v= = In(F(0)) : Optical Theorem 


Let us consider ways in which we can identify the phase shifts 6,. Consider a solution, 
restricted to a fixed 


x)), = (Agje(kr) + Bene(kr)) Yeo(0) x >a (7.2.22) 


If B40 then V £0. As a matter of fact, if V = 0 the solution should be valid everywhere 
and ng is singular at the origin, thus By = 0. 
Now, let’s expand ¢(x)|¢ for large kr as in (7.2.10) 


Ag é br Be lr 
W(x) = F sin G — =) — |, €08 G — =) Y¥¢.0(0) (7:2.23) 
Define B 
tan dg = =i (7.2.24) 


We must now confirm that this agrees with the postulated definition as a relative phase 
between outgoing and ingoing spherical waves. We thus calculate 


oF sin G — 2 + x) Y¢.0(0) (7.2.25) 


au ip Lt i(kr-426~) _ ,-i(kr-&) 
kr Bi [e ° e 2 | ool) (7.2.26) 
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This shows that our definition of 6; above is indeed consistent the expansion in (7.2.16). 
Finally we can read the phase shift from (7.2.25) 


1 bn 
(x)| ~ —sin | kr — — + d¢ } Ye0(0) r>a (7.221) 
¢ kr 2 , 
in the partial wave solution. 
As an aside note that both the outgoing and ingoing components of a partial wave are 
separately acceptable asymptotic solutions obtained as: 


e If Ag = —iBy 
. en 
- en lm] Yoo(0) — et(4r-‘F) 
its prey teice k , ~ Yp.0(0 7.2.28 
v()|, f sin G ; ) + cos ( ae )] kr a 2,0(9)  ( ) 
e If Ag =iBe 
oil) 
| pee eas, Fi) (7.2.29) 
v(x) £ kr an 
Each wave is a solution, but not a scattering one. 
7.2.2 Example: hard sphere 
< 
V(r) = 1 ene (7.2.30) 
0 r>a 
Radial solution _ 
Re(r) = — = Acje(kr) + Bene(kr) (7.2.31) 
(a, 0) = S~ Re(a)Pe(cos@) = 0 (7.2.32) 
é 
The P(cos@) are complete, meaning that 
0 = Re(a) = Agje(ka) + Beng(ka) VL (7.2:33) 
Recalling 
Be _ je(ka) 
tan dg A ake ( ) 
this implies all phase shifts are determined 
jo(ka) 
= 7.2.35 
tan dg ee ( ) 
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We can now easily compute the cross section o, which, recalling (7.2.20) is proportional to 


sin? Og 
2 

_2- _ tan“ dg 

sin* 6g = 


Jt (ka) 


hence 


Ar = : Ar = 
a= Fa (+1) sin’ be = 75 (20+ Ve 
£=0 


£=0 


Griffiths gives you the low energy ka < 1 expansion 


Oo 


are 1 Of 
ee fap 2+ 1 1 (28) 


At low energy the dominant contribution is from ¢ = 0 


1+ tan? 5; je(ka) + n3(ka) 


4 
] (kayt*? 


An 
Repent Dis 2 Full area of the sphere! 
o.= k2 (ka) = 4a Not the cross section! 
One more calculation! 
‘tan é 1D gO = eM | 9i6 _ 1t+itand 
itand = — ——— => ee = —____ 
ed + etd e215 4.1 1—itand 
Therefore i 
» Je\ka Ys Tyas : 
e2ide — 1+ 05 a) nti . ig—in) 


ne(ka) 


je(ka) — ine(ka) 
je(ka) + ing(ka) ’ 


ride — 


yj oekk) ~~ n— aj —i(j +in) 


Hard sphere 
phase shifts 


7.2.3. General computation of the phase shift 


(7.2.36) 


(7.2.37) 


(7.2.38) 


(7.2.39) 


(7.2.40) 


(7.2.41) 


(7.2.42) 


Suppose you have a radial solution Re(r) known for r < a (V(r > a) = 0). This must be 
matched to the general solution that holds for V = 0, as it holds for r > a: 


Hw 


At r=a must match the function and its derivative 


__ Ay lg lb) + Bo (ry (er) 


Re(a) = Agje(ka) + Bene(ka) 


(7.2.43) 
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aR)(a) = ka (Acit(ka) + Beni(ka)) (7.2.44) 
Let’s form the ratio and define / as the logarithmic derivative of the radial solution 
aR;(a) Agje(ka) + Bony (ka) je(ka) + Fn} (ka) je(ka) — tan denp(ka) 
b= = ka— = ka a = ka 
Re(a) Agje(ka) + Bene(ka) je(ka) + ai ne(ka) je(ka) — tan dgng(ka) 
(7.2.45) 
In principle we can use eq. (7.2.45) to get tan de, but let’s push the calculation further to 


extract the phase shift from e?"°¢ 


jecos de — mpsindg _ k Je (e% + ee) + in) (e% = ee) 


= a : 
Be je cos Og — ne sin dg je (e%e + e- te) + ing (ete — ete) 
= eg Set inh) Fee) _, Get tm) + eH) 9 ap) 
ee(jp + ing) + ee (jp — ing) eH9e( jp + ine) + (Je — ine) 
Solve for 7% Sete a 
eribe — ka(je — inp) — Be(je — tne) (7.2.47) 
ka(je + inp) — Be(je + ine) 
which can be rewritten as 
_ Je—ing 
erie — (4 7 1) pea ee te) (7.2.48) 
ip +4 jeting a 
Jer ine Be — ka (= ) 
which we can also write as 
— ka j= ) 
eride — pike pe chal (7.2.49) 


YY 
Hard sphere By — ka (am 
phase shift 


Phase shifts are useful when a few of them dominate the cross section. This happens 
when ka < 1 with a the range of the potential. So short range and/or low energy. 


1. Angular momentum of the incident particle is L = b- p with b impact parameter and 
p momentum 


L-bp > hlebhk > be (7.2.50) 


wl] es 
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When b > a there is no scattering 


e 
R74 Expect no scattering for 0 > ka (7.2.51) 


Thus only values ¢ < ka contribute to o. 


. Confirm the impact parameter intuition from partial wave expression. Consider the 


free partial wave ~ jo(kr)¥¢o. The impact parameter by of such a wave would be 
estimated to be ¢/k. 


Recall that jg(kr) is a solution of the V = 0 radial equation with angular momentum 
é and energy h?k?/(2m). Setting the effective potential equal to the energy: 


ReO(e+1) Wek? 


= 7.2.52 
2mr? 2m ( ) 

we find that the turning point for the solution is at 
k?r? = &(€+1). (7.2.53) 


Thus we expect je(kr) to be exponentially small beyond the turning point 


kr < Jf@(€+1)~2 (7.2.54) 


confirming that the wave is negligible for r less than the impact parameter ¢/k! 


Pr ©) 


Figure 7.2: Plot of p?J?, (p) 


7.3 Integral scattering equation 


Some useful approximations can be made when we reformulate the time-independent Schrédinger 


equation as an integral equation. These approximations, as opposed to the partial waves 
method, allow us to deal with potentials that are not spherically symmetric. We consider 
the Schrédinger equation 


h? 2 
5777 + V@)] He) = Be), (7.3.1 
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set the energy equal to that of a plane wave of wavenumber & and rewrite the potential in 
terms of a rescaled version U(r) that simplifies the units: 


h2 2 h2 
= _ and V(r) = aru “) (7.3.2) 
then the Schrodinger equation reads 


[-V? + U(r)] o(r) = k(x) (7.3.3) 


rewrite as 
(V? +k?) p(x) = U(r)v(r) (7.3.4) 


This is the equation we want to solve. 
Let us introduce G(r — r’), a Green function for the operator V? + k?, ice. 


(V? +k?) G@—-r') = 59 —-r) (7.3.5) 
Then we claim that any solution of the integral equation 
w(e) = ole) + faer’Ga—r Ue Wwe") (7.3.6) 
where Wo(r) is a solution of the homogeneus equation 
(V? + k*) do(r) =0 727) 


is a solution of equation (7.3.4). Let’s check that this is true 


(V2 +k?) o(r) = (V2 + k?) ferew —r')U(r')v(r’) 


To find G we first recall that 


ce for r #0 as a matter of fact (v2 + k?) aus =0 VrF0 


G(r) ~ 
— for r > 0 as a matter of fact V2 (-z) = 53(r) 
(7.3.9) 
Thus try 
1 etikr 
= poe 7.3.10 
Gat) 4nr ( ) 
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and we are going to refer to G1 as the outgoing wave Green function and to G_ as the 


ingoing wave Green function. 
Let’s verify that this works, write G as a product: G(r) = et**" (-z), so that 


VGa(r) = fee) (-z) 4 etikry? (-=) sf 2 (Ver) 3(-2) 


(7.3.11) 
Recall that 
V=V-V, Vr=-, V«(fA)=VF A+tfV-A (7.3.12) 
Therefore dik 
Verikr = aiee ; V2erikr = (-# ze =) erikr (7.3.13) 
then 
VG4(r) = (—k? 2ik otikr 7 eee + ett 93 (p) 49 ( iketitr *) ( r ) 
i i Arr aii r Arr 
Dike 2 
= =hGs (r) = tk etikr ze 63(r) an tk Eikr 
ir Tr 
= —k’Gi(r) + &(r) Vv (7.3.14) 
We will use 
wo(r) = e'*? and G=G4 (7.3.15) 
Thus, with these choices, (7.3.6) takes the form 
oars i; Br Ga (r—r')U(r wr’) (7.3.16) 
where 
i , 1 etklr—r'| Be 
ae ean ae) (7.3.17) 


We now want to show that this is consistent with our asymptotic expansion for the energy 
eigenstates. For that we can make the following approximations 


For the G, denominator: jr—r'|xer (7.3.18) 


For the G4 numerator: jr—r'|er—n-r (7.3.19) 


where 
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In this way 
1s sje Xe 
Gy(r—-r)= yee eres (7.3.20) 
Thus ; 
tkz 1 3.4 —ikn-r’ ! / en 
wr) =e" + es d’re U(r')wv(r’) - (F321) 


The object in brackets is a function of the unit vector n in the direction of r. This shows that 
the integral equation, through the choice of G, incorporates both the Schrédinger equation 
and the asymptotic conditions. By definition, the object in brackets is f;,(0, ), i.e. 


fi(9,¢) = -— i dre T'U (r')ah(r') . (7.3.22) 


Of course, this does not yet determine f; as the undetermined wavefunction w(r) still 
appears under the integral. The incident wave has the form of e’*i™ with k; the incident 
wave number, |k;| = &. For the outgoing wave we define kg = nk (in the direction of n), 
the scattered wave vector. The expression for ~(r) then becomes 


te) = eter [Ef atte ery etyoey] (7.3.23) 
We will do better with the Born approximation. 
7.3.1 The Born approximation 
Consider the original integral expression: 
w(r) = eT 4 [aves —r')U(r')v(r’) (7.3.24) 


Rewrite by just relabeling r > r’ 
wr’) = eter pariose’ =r" U(r" )y(r"). (7.3.25) 
Now plug (7.3.25) under the integral in (7.3.24) to get 


w(r) = efter | BPr’G (er eens [ d3r'G, (r—r’)U(r’) fae'ose' Ue we) 
(7.3.26) 
Repeat the trick once more to find 


wir) = er a [arent —r’) U(r' eter’ 
+ fereye rue) fare! - 2") Ue" 


+4 [arent _ ruta’) f aee'cy( 7, r”)U(r”) partes" = 0 )U (re )v(r"”) 
(7.3.27) 
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By iterating this procedure we can form an infinite series which schematically looks like 


w=ers Fouelsrs fau fave + fou fou fave +... (7.3.28) 


The approximation in which we keep the first integral in this series and set to zero all others is 
called the first Born approximation. The preparatory work was done in (7.3.23), so we 


have now 
; ‘ ikr 
Born (yp )= _ ik oe =(/ d®r’e —iks-r ‘veye*") ee (7.3.29) 
r 


hence 


£2°™(0,8) = - 7 f are). (7.3.30) 


Here we defined the wave-number transfer K: 


|ki)=|ks], K=ks—k; (7.3.31) 


—>» 


U 


Note that we eliminated the primes on the variable of integration — it is a dummy variable 
after all. The wave-number transfer is the momentum that must be added to the incident 
one to get the scattered one. We call @ the angle between k; and kg (it is the spherical 
angle 6 if k; is along the positive z-axis). 

Note that 


= |K| = 2ksin § (7.3.32) 


In the Born approximation the scattering amplitude f;,(6, ¢) is simply the Fourier transform 
of U(r) evaluated at the momentum transfer K! f;,(0,) captures some information of V(r). 

If we have a central potential V(r) = V(r), we can simplify the expression for the Born 
scattering amplitude further by performing the radial integration. We have 


1 2m 


Born/g = 3 ek yr 
fP"0) =- / PreK V(r), (7.3.33) 


By spherical symmetry this integral just depends on the norm K of the vector K. This is 
why we have a result that only depends on @: while K is a vector that depends on both 0 
and ¢, its magnitude only depends on @. To do the integral think of K fixed and let O be 
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the angle with r 


—y 
K 


oo 1 ; 
JEG) = a 2ndr P| d(cos @)e**" 8 V(r) r/ 
TM JO -1 
@ 
oo —ikr ek : 
22 9 e — 2sin(Kr) 
=" 92). drr*V(r) a = af drr?V (A (7.3.34) 
hence 
fx(@) = af drrV(r) sin(Kr) (7.3.35) 


with K = 2k sin § 
The Born approximation treats the potential as a perturbation of the free particle waves. 
This wave must therefore have kinetic energies larger than the potential. So most naturally, 


this is a good high-energy approximation. 


Example: Yukawa potential Given 


V(r) =V(r) = s— B,u>0 (7.3.36) 
from (7.3.35), one gets 
io) = ls ie dre *" sin(Kr) = — EER ; (7.3.37) 


We can give a graphical representation of the Born series. Two waves reach the desired 
point r. The first is the direct incident wave. The second is a secondary wave originating at 
the scattering “material” at a point r’. The amplitude of a secondary source at r’ is given 
by the value of the incident wave times the density U(r’) of scattering material at r 

In the second figure again an incident wave hits r. The secondary wave now takes 
two steps: the incident wave hits scattering material at r’ which then propagates and hits 
scattering material at r’, from which it travels to point r 
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Figure 7.3: Pictorial view of Born approximation. Wave at r is the sum of the free incident wave at r, 


plus an infinite number of waves coming from the secondary source at r’, induced by the incident 
wave. 


Figure 7.5: Given (0, ¢) and k (~ energy), K is determined. 


Chapter 8 


Identical Particles 


© B. Zwiebach 


Two particles are identical if all their intrinsic properties (mass, spin, charge, magnetic 
moment, etc.) are the same and therefore no experiment can distinguish them. Of course, 
two identical particles can have different momentum, energy, angular momentum. For 
example all electrons are identical, all protons, all neutrons, all hydrogen atoms are identical 
(the possible excitation states viewed as energy, momentum, etc.) 


8.1 Identical particles in Classical Mechanics 


We can assign a labeling of the particles and follow through. Results are labeling indepen- 
dent. 


As shown in Fig.8.2 we have two cases: 

e Case 1: Assume we solve the dynamics and find 
ri(t) =r(t) with r(to) =ro 
ro(t) =r’ (t) with r’(to) =r 


e Case 2: Since the particles are identical the Hamiltonian must make this clear 


A(r1, Pi3¥2,P2) = A(r2, p2sri, Pr) (8.1.1) 
This time, when we solve 
ri(t) =r'(t) with ra Ce ere 
ro(t) = r(t) with r(to) =Yro 


The two descriptions are equivalent. We can follow the particles and the particle that 
started at {ro, po} will be at r(t), while the one that started at {rp, pp} will be at r’(t). 


In conclusion, we choose one labeling, just as if the particles were different. Follow 
through without regard to other possible labeling. 
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Figure 8.1: Example of two different ways of labeling the initial state 


8.2 Identical particles in Quantum Mechanics 


8.2.1 Exchange degeneracy 


If we cannot follow particles once they overlap and/or collide, we can’t know what alternative 
took place. 
Question: How to write kets for the initial and final states? 


Simpler case to see the complications Let there be two spin-1/2 particles. Consider only 
their spin', one is |+), the other |—). Recall tensor product |v;)(1) ® |vj)(2) describing 
particle 1 in state v; and particle 2 in state vj. More briefly |v;)(1) ® |vj)(2) = |vi) @ |v;) 
with the understanding that the first ket is for particle 1 and the second for particle 2. 
What is the state of the two spin 1/2 particles? 


I+)a) @l-)ay or |) ay I+) Qay (8.2.1) 


A priori either one! This, despite the fact that in our conventions for inner products these 
two states are orthogonal! Can we declare these states to be physically equivalent and thus 
solve the ambiguity? No. If the two states above are equivalent we would have to admit 
that even the states 


|b) = a|+) @ |-) + B|-) @ |+) (8.2.2) 


‘in this approximation we could say that they are static and very close to each other, even on top of each 
other 
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O—-> © <— 
i @ 


Figure 8.2: Example of two different processes involving identical particles. In principle, in 
quantum mechanics it is impossible to tell which one happened. 


with |a|? + |8|? = 1 for normalization, are equivalent. This ambiguity in the specification 
of a state of identical particles is called erchange degeneracy. 

What is the probability to find these particles in the |+, x) ® |+, x) state? 

This states is 


Wo) = 3 (Ho + I-)ay) @ (He a5 I-).2)) 


1 
=S(HmeHe + Ha ee + HaeHe + wee) 23) 


the probability is 
2 1 2 
l(volv)| = |5(2 +2) (8.2.4) 


thus a tremendous ambiguity because the chosen values of a, 8 matter! 
Three particle degeneracy; 3 different eigenstates |a), |b), |c) give the following combinations 


|a) (1) ® |b) (2) ® |e)(3y |a) (1) ® le) 2) ® |) (3), 
1b) (1) ® |e)(2) ® |a)(zy , 1b) (1) ® |a)(2) @ le) (ays 
Ic)(1) @ |a)(2) ® |b) a) 5 Ie) (1) @ |) (2) ® |a) a) - 
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8.2.2 Permutation operators 
Two particle systems 
Consider the case when the vector space V relevant to the particles is the same for both 
particles, even though the particles may be distinguishable. Call the particles 1 and 2. 
Consider the state in which particle one is in state u; and particle 2 in state u; 
|245) (1) & |uj)(2) EVEeV (8.2.5) 

Since the particles are possibly distinguishable note that 

|2ui) (1) ® |g) (2) F |ey) (ay @ Lees) (2) - (8.2.6) 


Define P2; the linear operator on V ® V such that 


Pr, [ere ® |uj)(2y} = |Uj) (1) @ [wi (ay - (8.2.7) 


note that 
Po Po = 1, (8.2.8) 


i.e. Po, is its own inverse. Claim 


Pi, = Py, Hermitian (8.2.9) 


Proof: First of all let’s recall that for a generic operator O, the adjoint of O is defined s.t. 
(aJO|8) = (8|Ola) (8.2.10) 
In our case, not writing the subscript labels, 
(up| @ (wel Poult) ® 15) = (ugl @ (uel ([eep) ® |es)) = dp, 5e 
(ui| @ (wel Ph |eue) & lusy) = ((uil © (4j|Porlean) @ Jue) )” = [(eusl @ (ugl (lee) @ led) |” = [6:0854 |” = 84950 


hence 
PlL=Py Vv. (8.2.11) 


Because of (8.2.8) we also have that Py is unitary: 
Pl, Poy = Py Po = 1. (8.2.12) 


Given a generic state |W) it is not clear a priori what would it be its behaviour under 
the action of P21, hence to make our life easier we want to rewrite a generic state |y) in 
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terms of eigenstates of Py. 
We can define two eigenstates of Pj; with the following properties 


P1|Ws) = |ws) Symmetric state 
Pyi\Wa) = —|da) Antisymmetric state 
and two operators Sand A 
4 1 A ~ 1 n 
S= gl + Pa), A= (4 — Pai): (8.2.13) 


Note that $ and A are such that 


a Oe ee, ee : 
Py S = 5 (Pa + Po Poi) = 5 (Pa +1)=8 


Py A= 5(Pa — Py, Px) = (Pa 1)= A 
Therefore, given a generic state |W) we have that 
Py S|) = S|) = S|) is symmetric 
P, Alp) = — Aly) = Alw) is anti-symmetric. 


Because of this, the Hermitian operators S and A are called symmetric/antisymmetric 
projectors. From a mathematical point of view S and A are orthogonal projectors? and 
satisfy 


S=S, A=A, S+A=1, SA=AS=0. (8.2.15) 


Action on operators Let B(n) be an operator acting on the n-th vector space, i.e. 
B(1)|us) a) @ luz) 2) = (Blu), ® |uyz) (2) 


B(2)|us) (1) ® |uz)(ay = |ua)ay ® (Blu)) 


then the action of Py; on B(1) is 
Py B(L) Phy les) ay ® [etj)(2), = Per BCL) Ij) ay les) 2) 
= Pu (Bluj)) © be) @ = lu) ay © (Bley). = BQ) lus) @ |us) 
(8.2.16) 


2A projector P: V > U CV is orthogonal if V = ker P® range P, with ker P | range P. Take v € V, 
then 


v= Pu tuy—Pu=utw (8.2.14) 
rangeP ker P 


where w € ker P, u= Pu € rangeP, then ker P | rangeP because 
(w,u) = (w, Pv) = (Ptw,v) = (Pw,v) = (0,0) = 0 
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hence 


Py B(1) Pi, = B(2) (8.2.17) 


Similarly we have 
Py B(2)Pi, = B(1) (8.2.18) 


and if we consider a generic operator O(1,2), then 


Py, O(1,2)P}, = O(2,1)|. (8.2.19) 


Note that if @(1,2) = O(2, 1) we say O(1, 2) is symmetric. If an operator is symmetric then 


0 = Px O(1,2)P3, — O(1, 2) 

0 = Px O(1, 2) — O(1, 2) Por 

0 = [P21, O(1, 2)] (8.2.20) 
[Po1,O(1,2)] =0 <= 0 is symmetric. (8.2.21) 


N particle systems 


In a N particle system we can define N! permutation operators Py vy» With Ps.) n being 
the identity. For a 3-particle system, for example, the operator Py», acting on a state has 
the effect of 


n—>1 n — th state moved to position 1 
Pigs means p>2 p — th state moved to position 2 
q7>3 q — th state moved to position 3 
e.g. 
Po31| Ui) (1) ® |uj) (2) @ |ue)(3) = |Uz) (1) ®@ lux) 2) ® |ua) (3) (8.2.22) 


You should check that its inverse is Pi. so that 


P21 P312 = 1. (8.2.23) 

More formally we can define a permutation of N numbers by the function @ that maps the 
standard ordered integers 1,...,.N into some arbitrary ordering of them 

a: [1,2,...,N] — [a(1), a(2), ..., a(V)] (8.2.24) 


and associate it with a permutation operator 
Py = Pe(1),0(2), ..0(N) (8.2.25) 


Pylur) (1) ® +++ ® fun) wy = ltaay)ay ® +++ ® Macy) (8.2.26) 
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For example 
P14o|t11)(1) ® |t2)(2) ® |tus)(3) ® [tda)(4) = /tea) (ay @ [41 (2) @ |ta) (3) @ |) (4) (8.2.27) 


or 
P3142|a) (1) ® |b) 2) ® |c)(3) ® |4)(4) = |e) ay @ |) (2) ® |d) (3) ® |b) (4) (8.2.28) 


The set of all permutations of N objects forms the symmetric group Sy and it has N! 
elements. For S3 we have 6 elements or the 6 permutation operators: 


aS cyclic « cyclic « ES cyclic - cyclic « 

F532 el to ogi y. igos a oie aot 

SY —— 
1 these are transpositions, 


a permutation in which only 
2 particles are exchanged 
without affecting the rest 


P,32 is a transposition in which the states of the second and third particles are exchanged 
while the first particle is left unchanged. For transpositions we sometimes use the notation 
where we just indicate the two labels that are being transposed. Those two labels could be 
written in any order without risk of confusion, but we will use ascending order: 


(12) a P 13 
(13) — P354 (8.2.29) 
(23) = Pize 


While all permutations that are transpositions are Hermitian (see the proof for P», that 
easily generalizes), general permutations are not Hermitian. It is intuitively clear that any 
permutation can be written as product of transpositions: any set of integers can be reordered 
into any arbitrary position by transpositions (in fact by using transpositions of consecutive 
labels). The decomposition of a permutation into a product of transpositions is not unique, 
but it is unique (mod 2). Hence we have that every permutation is either even or odd. A 
permutation is said to be even if it is the product of an even number of transpositions, and 
it is said to be odd if it is the product of an odd number of transpositions. 


Since transposition operators are and unitary any permutation is a unitary operator. All 
transpositions are also Hermitian, but an arbitrary product of them is not hermitian be- 
cause the transpositions do not necessarily commute. 


In fact, the Hermitian conjugate of a permutation is its inverse, which is a permutation 
of the same parity. This is clear from writing P, as a product of transpositions P;,: 


ee eas cdl =P,,... P,P, (8.2.30) 


and therefore a 
= P,Pl = 1 (8.2.31) 
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Theorem 8.2.1. The number of even permutations is the same as the number of odd 
permutations 


Proof. Consider the map that multiplies any permutation by (12) from the left (12) : 
Peven + Poaa 80 that if o € Payen then (12)o € Poag. This map is one to one 


(ie= 2a" == 6 Sa, (8.2.32) 


by multiplying from the left by (12), which is the inverse of (12). This map is also surjective 
or onto:for any 8 € Poaa, we have 6 = (12) (12)2. 
ee 


€Peven 


A\B || P3i2 | Posi | (23) | (12) | (18) 
P3190 P31 1 (12) (13) (23) 
Posi) a Pape ay aay a) 
8) G3); | G2) I. a Br | Pa 
(12) | (23) | (13) | Pao | 2 | Poss 
Gay aay Csr eae | 


Table 8.1: A- B matrix for $3. 


Complete symmetrizer and antisymmetrizer 


Permutation operators do not commute, so we can’t find complete basis of states that are 
eigenstates of all permutation operators. It is possible, however, to find some states that 
are simultaneous eigenvectors of all permutation operators. 


Consider N particles, each with the same vector space. Let P, be an arbitrary permutation, 
then 


Symmetric state |¢g) : P,|ws) =|vs) Va 
Antisymmetric state |q,) : Py | qa) = €q|Wa) 


where 


pee fe if Py is an even permutation (8.2.33) 


—1 if P, is an odd permutation 


In the total Hilbert space V°N = V @---@V, we can identify a subspace Sym“ V c V®N 
te YEP 


N 
of symmetric states and a subspace AntiNV c V®% of antisymmetric states. Can we 
construct projectors into such subspaces? 
Yes! 


B 1 A A if . 
S= NT Ss" PB, and A= NI se Cols (8.2.34) 


8.2. IDENTICAL PARTICLES IN QUANTUM MECHANICS 159 


where we sum over all N! permutations. S is called the symmetrizer and A is called the 
antisymmetrizer. 

Claim: §=S' A=Al (8.2.35) 
Hermitian conjugation of Py, gives Py 1 which is even if Py is even and odd if Py is odd. 
Thus Hermitian conjugation just rearranges the sums, leaving them invariant. 
Moreover 


P,, 5 =8P,, =8 (8.2.36) 

P,, A = APoy = €a,A (8.2.37) 
Proof. Note that Py, acting on the list of permutations simply rearranges the list, given 
two permutations P,, # P,, then Pro Py, F Py, Py, hence 


ee oe | a 1 Bs 

P,,S = a0 WI oa Fe = Fy DL PaoPa = aise v (8.2.38) 

No | — ne 

analogously 

Ker ge ; 1 ia ie 1 a oe 

Poo A = Poo a Saree ari So eePaPy= at Se eneaptan Pog ha 

a a a 
=1 
€ ees € A n 
= aT d EatayPayPa = a4 x egPp=€aA Vv (8.2.39) 


S=S8, AM=A, SA=AS=0 (8.2.40) 

Gat No pega a gee: 8.2.41 
= P= oS nes (8.2.41) 
eis poral aay ete ee 9.42 
Saye Q = Wi DL fata — WI ~ WI : = (8.2: ) 
GEO NS, H Gs UR ORR ey Sa 
S= ay DL ta a a Dot = Dae = (8.2.43) 


Since, as explained before there are equal numbers of even and odd permutations, i.e. 


Sota =U! 
Note that F oe : 
S|) €Sym*V_ since P,S$|y) = S|W) Va (8.2.44) 


and analogously : oes ‘ 
Alp) € AntiNV since P, Alp) = €, Aly) Va (8.2.45) 


Hence, they are, as claimed, projectors into the symmetric and antisymmetric subspaces: 


Save" = Sym" V, Arye > Anti y (8.2.46) 
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Example: N =3 For S3 we have 6 elements or the 6 permutation operators: 
Pio3 =1, P32, Posi, Pis2, Pais, Psa 


In this case the symmetrizer and antisymmetrizer operators S' and A are 


S = §(1 + Pao + Posi + Pio + Por + Pyar) (8.2.47) 
A= §(1 + Psi2 + Posi — Pigz — Poi3 — P32) (8.2.48) 

Note that 
S+A= F(1+ Pao + Posi) 41 = Pros (8.2.49) 


This is a manifestation of the fact that in general for N > 2, we have 
Sym’ V @ AntiNV c VON (8.2.50) 


i.e. in principle, the N-particle Hilbert space is not spanned by purely symmetric or anti- 
symmetric states. 
We define O(1,2,...,.N) to be a completly symmetric observable if 


[O(1,2,...,.N), Py] =0 Va (8.2.51) 


8.3. The symmetrization postulate 


In a system with N identical particles the states that are physically realized are not ar- 
bitrary states in V®%, but rather they are totally symmetric (i.e. belong to Sym V), in 
which case the particles are said to be bosons, or they are totally antisymmetric (7.e. belong 
to Anti’ V) in which case they are said to be fermions. 


Comments: 


1. The above is a statement of fact in 3D. Alternative possibilities can happen in worlds 
with 2 spatial dimensions (anyons) 


2. The postulate describes the statistical behaviour of bosons and of fermions 


3. Spin-statistics theorem from Quantum Field Theory shows that bosons are particles 
of integer spins (0, 1, 2, ...) while fermions are particles of half-integer spin (1/2, 3/2 


pee 


4. The symmetrization postulate for elementary particles lead to a definite character, as 
bosons or fermions, for composite particles, which in turn obey the symmetrization 
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postulate. Take for example two hydrogen atoms 


i=) e 
Cy) 
H-alon H- atten 


p= praton sa 2 kann Copa 4) 
O= Moder We foarmen( 9 | 


The system is made by 4 particles and its total wavefunction is U(p,, 1; p2, e2). Since 
the two electrons are identical particles of spin 1/2, the wavefunction must be anti- 
symmetric under the exchange e] © e€9 


V(p1, €2; p2,e1) = —U(pi, €1; pa, 2) - (8.3.1) 


Exactly the same argument applies to the protons, 


WU (po, €1; pi, €2) = —V(p1, e1; pe, €2)- (8.3.2) 


Therefore under the simultaneous exchange 
U (po, €2; pi, €1) = +W (pi, e1; pa, €2) - (8.3.3) 


The exchange in (8.3.3) corresponds to pi p2 and e1 2, an exchange of the t wo 
hydrogen atoms! Since the wavefunction is symmetric under this exchange, (8.3.3) 
shows that the hydrogen atom is a boson! 


5. The symmetrization postulate solves the exchange degeneracy problem. 


Say ju) € V®% represents mathematically a state. Let Viuy = span { P,.|u) Va}. 
Depending on |u) the dimension of Vj, can go from 1 to N!. This dimensionality, if 
different from one, is the degeneracy due to exchange. 

The exchange degeneracy problem (i.e. the ambiguity in finding a representative for 
the physical state in Vj,)) is solved by the symmetrization postulate by showing that 
Vi.) contains, up to scale, a single ket of Sym V and a single ket of Anti’ V. 


Proof. Suppose we have two states |q), |¢") € Vj) that both happen to be symmetric: 
2), |W’) € Sym V. We can write them as 


Iw) = So caPalu) and |) = Soe Palu) (8.3.4) 
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with cy, c, some coefficients. Then, since |7) € Sym V 


|b) = Sb) = SD caPalu) = S > caSPalu) = 5 caS|u) = Su) S~ ca. 


Analogously ; > 
/ / 
) = Slu) > _ ca: 


Hence |w) x |v’), the states are the same up to scale. 


. Building antisymmetric states. 


Constructing the three-particle state Alu) with |u) € V®3 given by 


|u) = |¥) (1) @ Ix) (2) @ |) (3) 


The claim is that the antisymmetric state is constructed by a determinant: 


cr 


= 4) €oFale)(y @1xX)2) @lw)ay = HII We Xe) 


(8.3.5) 


(8.3.6) 


(8.3.7) 


(8.3.8) 


When writing the products in the determinant one must reorder each term to have 
the standard order |-) (1) @ |-)(2) @ |-)(3). You can confirm you get the right answer. 


Now do it generally. Recall the formula for the determinant of a matrix 


det B = Ss" €aBo1),1Ba(2),2 ss8 Ban),N 


Let |w) be a generic state € V@% 


lw) = |w1) (1) |w2) (2) tee lw) (wv) 


Po |w) = |wacay)(1)|Wa(2)) (2) -»- Wacwy) (HN) 


Ala) = 5D GaP) = Hq Do caleac yearn) ey bacon 


If we define a matrix 
wig = |wi) (3) 
then ‘ 
Alw) = NI 7 EaWe( ),2°+-Wo(N),N = Fy det (w) 


(8.3.9) 


(8.3.10) 


(8.3.11) 


(8.3.12) 


(8.3.13) 


(8.3.14) 
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i.e. 
lwi)(a) wid) «+. [wy 
. , [2a lea + [Hedy 
Alo) = aR] | (8.3.15) 
lw) (1) see see wn) i) 


8.4 Occupation numbers 
Consider a system of N identical particle. Basis states in V®% take the form 
5) (1) @ --- (Up) (n) (8.4.1) 
where the one-particle states form an orthonormal basis of V: 
V = span{|uz), |u2), ...} (8.4.2) 


By applying S or A to the full set of states in V@% we obtain all physical states in Sym’ V 
and Anti’ V. But many different states in V®% can give rise to the same state in Sym’ V 
and Anti’ V after the application of the projectors. 

To distinguish basis states in V®% that after application of § or A are linearly indepen- 
dent, define the occupation number. We assign a set of occupation numbers to a basis state 
|-) @-++-@J|-). An occupation number is an integer n; > 0 associated with each vector in V: 


lita). |tt9) » ee [Wess aes (8.4.3) 


We define nz to be the number of times that |u,) appears in the chosen basis state 
|) @---@|-). Thus, by inspection of the state |-) @--- @ |-) we can read all the occu- 
pation numbers nj, 72,---. It should be clear that the action of a permutation operator on 
a basis state in V@% will not change the occupation numbers. 


Two basis states in V@% with the same occupation numbers can be mapped into each 
other by a permutation operator; they lead to the same state in Sym V and to the same 
state (up to a sign) in Anti V. Two basis states in V® with different occupation numbers 
cannot be mapped into each other by a permutation operator. They must lead to different 
states in SymV and to different states in Anti’ V, unless they give zero. 


Given the occupation numbers of a basis state, we denote the associated basis state in 
Sym’ V as follows 


\n1, N2, .+- )s M4 > 0 (8.4.4) 
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Explicitly 
\n1, NQ,..- )s = cS |w1) Sea |w1) ® |w2) ae |w2) ®... (8.4.5) 
ny, times n2 times 


where cg is a constant that is used to give the state unit normalization. More briefly we 
write 
cg 5 |u1)®™ @ |us)?™ @... (8.4.6) 


where |u;)®” is equal to 1 when n; = 0 These states form an orthonormal basis in Sym‘ V: 


\n1, na, x54 NG 


s(n, Nb, Sone \n4, NQ, ++. )s = Ons nt Onan, arse (8.4.7) 
The space Sym’ V relevant to identical bosons is spanned by all the states 


[pe Pedy see) with Sone = (8.4.8) 
k 


The space Anti’ V relevant to identical fermions is spanned by all the states 


|n1, M2,.--)A with So nk =N and nz € {0,1}, (8.4.9) 
k 


since occupation numbers cannot be greater than one (any state with an occupation number 
two or larger is killed by A). We have 


|n1,N2,..-)A = ca Alu,)®™ @ |219) 2”? ®... (8.4.10) 


where cy, is a constant that is used to give the state unit normalization. These states form 
an orthonormal basis in Anti’ V. 


8.5 Particles that live in V@ W 


A particle may have space degrees of freedom, described by a vector space V and spin degree 
of freedom associated with W. Suppose we have a state than of 2 such particles described 
in (V @ W)®? for example 


|b) = vi) (1) ® |wi) (1) ® v5) (2) & |w;)(2) +... (8.5.1) 


This w should belongs either to Sym?(V @ W) or to Anti?(V @ W). The permutation 
operator here that exchange particles 1 and 2 is 


Psaro|) = Pyaa( vi) aye) 2) @10j)(2)e09)@)) = |vj) (1) ®|wy) (1) |v) (2) @|wi) (2) (8.5.2) 


Wanto to express this in terms of Sym?V, Anti?V, Sym?W, Anti?W. 
Why? Because it is possible: for any state 


Ww Ww 
..ja)... |b) = 


€ Sym?W € Anti?W 
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Thus we can assume we work with simultaneous eigenstates of P3214, which exchanges the 
V states, and of Pi432, which exchanges the W states. Note that 


Paro = P3214Pya32 (8.5.4) 


where the order is not important since [Paoia, Pya39] = 0. 
The eigenvalues are 


P3214 | Pia32 | P3a12 
1 1 1 
—l —l a 
1 —l —l 
—l 1 —l 
This means that 
Sym?(V @ W) ~ (Sym?V ® Sym?W) @ (Anti?V @ Anti?W) (8.5.5) 


Anti?(V @ W) ~ (Sym?V @ Anti?W) © (Anti?V @ Sym?W) 
where with ~ we indicate 
vs) (1) ® |wa) (1) @ |vs) (2) @ |wz) 2) & |vs) (1) @ |¥s) a) @ |wa) (2) @ [w5) (gy - (8.5.7) 


The generalization to 2 particle belonging to (U @ V @ W) is simple, for 3 or more particle 
is more complicated. 


Example. Two electrons with spin wavefunction 


W (x1, 771; X1, M2) = (#1, £2) - x(™m1, M2) S.=mh (8.5.8) 


x can be some normalized state in the space spanned by the triplet and the singlet. The 
probability dP to find one electron in d?x, around x, and in d?xzg around xg is 


dP = |$(X1, X2)|?d°x1d°x2 (8.5.9) 
Assume for simplicity that the electrons are non interacting so that the Schrédinger equation 
- v3, + V(x1) - re V(x) U= EV (8.5.10) 


is separable, so there is a solution of the form W4(x1)wpB(x2) with 


[ xleacor =1, [exleacol =1, [Px 0Acovnb0 = aan #0 
(8.5.11) 
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By Schwarz’s inequality |(u,v)| < \/(u, u)./(v, v) we have 
[ex va00bal0) = Ma,t0)] < VICatalVIGa wa] S [loan] <1] (8.5.12) 


But then must build 


b+(X1, X2) = vs (wWa(x1) vB (x2) x= Wa(X2)0B(X1)) (8.5.13) 


with N+ a real normalization constant. 


Take the following combination of ¢ and x: 


P+ + Xsinglet so that the total wavefunction is antisymmetric 


— * Xtriplet any of the 3 states of the triplet 


Again, the probability to find one electron in d?x, around xz and in d®x2 around xg is 


dP: = |¢ (x1, X2)|?d?x1d?x2 


= AE {\waler)vn (xa? + |Wa(xa)ve (x1)? + [WH (x1) a (X2)0B (x2) 0B (x1) \abxrd?xe 
— SS 


V2 
Exchange density 
(8.5.14) 
If we take the case x1 = xg = x, we get 
dPs = Na{ Walaa (x)!? + valoda ol? }ePxrd xa 
dP, = 2Ni\W4(x)vp(x)|?d?x1d?xe (8.5.15) 
dP_=0 (8.5.16) 


Recall that P, is associated with the singlet, while P_ with the triplet. Therefore electrons 
avoid each other in space when they are in the triplet state. In the singlet states there is 
enhanced probability to be at the same point. 

Note that normalization requires 


2 
la : dx? x0 { ha xa)ebp (xa/? + [ba (a) eb (xa)? [DH (reba (Ka) U (Xa) (%a)] f 


= AE {rere on| [oar vaceyonce) f oPxavi0ea)abea)] | 


= N2{1+R[aap-o%p] } 


= N2(1i+ 2 Ny = ——____ 8.5.17 
2(1 + leasl’) Sy eacer: (8.5.17) 
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So 
2 243... 43 
dP, = ———g|a(x) (x) |°d?x1d° x2 (8.5.18) 
1+ |a4p| 
to be compared with 
dPp = |a(x)|? ba (x)/?d?x1d?xo (8.5.19) 


for distinguishable particles. 
Since from (8.5.12) we have |a4p < 1|, then 


dP, >dPp (8.5.20) 


% 


NN Re 


Assume ~4(x) is nonzero only in a region Ry, Wg(x) is nonzero only in a region Rg and 
RaQ Rp =0, then 


QAB = [ xvacounte =0 (8.5.21) 


since w,4 requires x € Ry and wp requires x € Rg. Therefore in this case Ny = 1. 


Then the probability to find an electron in d?x, around x; € Ry and another in d°x2g 
around xg € Rp is 


dPL = {ales de al? + Whafrea)abetocr)? £ 29 [0% Or bata U5 (Xa the ocr]] dr d xa 
= |ba(x)?|Wp(x)/?d?x1d°x2 = dPp (8.5.22) 


it is the probability density for distinguishable particles. 
Therefore, there is no need to symmetrize or antisymmetrize the wavefunction of non over- 
lapping localized particles. 


8.6 Counting states and distributions 
E; energy of the i-th level, d; degeneracy of the i-th level. Assume we have N particles 


and we want to place Nj particles in level 1, ..., Nj; particles in level i (F;, d;). Let’s call 
Q(Ni, No,...) the number of ways to do this. 
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8.6.1 Distinguishable particles (Maxwell Boltzman) 


‘ , : N! ; ‘ ; ; 
Split N into Ny, No,... in Mino Ways (imagine putting them on a line... ..., after 
Ny No 
N, 


drawing one ball at a time). Placing N; particles in d; slots gives (d;)‘’’ ways, therefore 


(d 


.) Ni 
- Maxwell Boltzman 


Q(M, No,...) = NT 


8.6.2 Identical fermions 


Splitting the N states into groups for identical particles can only be done in one way 


ra an Ni < dj 
how many ways to place them? 
d; choose N; = di 
; choose N; Mild; — Np! 
=> Q(Ni, N2,...)=]] di 
pee" TL Nd — Ni)! 
8.6.3 Identical bosons 
=o 7 ae N; particles! 
N; balls and (d; — 1) bars, 
for example: o-\e\le-- [ieee ea 


How many ways to order the N; + d; — 1 objects? (N; + d; — 1)! but we must divide by the 
irrelevant permutations, hence 


QML, Na...) =T] aot (8.6.1) 


This is for bosons: 

Q(Ni, No,...) counts the number of ways to have N particles and fixed energy E = 
>>, NiE;. Want to find the values of Ni, No,... that are most likely. 

Want to maximize Q(N,, No,...) under the constraint that N = 5°, N; and E = 0, Ni Ej. 
Maximizing Q is the same as maximizing InQ and using lagrange multipliers 


(Ni, No,...) =InQ(N1,Na,...) +a(N — 50 Ni) + B(E - S° NiE;) (8.6.2) 
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Do the fermion one, using Qpr from (8.6.1) we have 


InQr = 5— Ind}! — In Nj! — In(d — Ni)! (8.6.3) 


Since all these quantities d;, N; are large we can use the Stirling approximation 


Inn! ey nlInn-—n (8.6.4) 
so that 
InQr = 5— djlndj—&—N;j ln Nj+D%—(di—N;) In(di— Nj) +i = S© dj In dj—Nj In Nj—(d;—Nj) n(di—N) 
(8.6.5) 
Minimizing w.r.t. to N; without constraints, 
na InQ@r =—-mN,-J+In(d; — Ni) +7 =In (54) (8.6.6) 


so that, adding the lagrange multipliers, we have 


Of | d; — N; d;— Ni _a+ee, - a+BE; ) 
So = 1n( nN, ) a BE; = “=e = d=Nil(e = 


d; 
a and § can be calculated using the equations 57, N; = N and }°, NiE; = E: 
T 1 
a= eee : B= kT definition of temperature (8.6.8) 
then 
d; 
e *B? +41 
The expected occupation number n ~ a for a single state is 
E-y(T) : 
n= (« kp 4 1) Fermi-Dirac distribution 
For bosons we have 
E-w(T) =k 
n= (« kpT i) Bose-Einstein distribution 


Since n > 0 we need F > yp for all energy levels w < E; Vi. For an ideal gas u(T) < 0 VT, 
ie. a>0 
For fermions 
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polo) & 


Figure 8.3: Occupation number for a state as a function of the energy for a system of identical 
fermions in the T— 0 limit. u(Z’'= 0) = Epis called the Fermi energy. 


Figure 8.4: Chemical potential w as a function of the temperature for a system of identical 
bosons. Tc, the critical temperature is defined to be the temperature such that pu(Tc) = 0 
and it’s the temperature for Bose-Einstein condensation 


gle 


Ve 


Figure 8.5: Chemical potential yw as a function of the temperature for a system of identical 
fermions. 


Aside from statistical mechanics 


OE 


= aN keeping constant entropy may require lowering the energy _ (8.6.10) 
SV 


m 


Suppose we add a particle with no energy to the system. S will increase (more ways to 
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divide up the total energy) for this not to happen, must reduce the energy. 


dE(V,S) =TdS — PdV + pdN 


= d(T'S) — SdT — PdV (8.6.11) 
d(E —TS) = —SdT — PaV + dN 
e~_-——’ 
F(T,V)=E-TS. (8.6.12) 


p is an intensive quantity, and in terms of F’ we have 


_ OF Adding a particle changes (8 6 13) 
= ON the energy and the entropy oe 
TV 


